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COMPOSITIONS AND METHODS FOR THE DIAGNOSIS AND TREATMENT 
OF HERPES SIMPLEX VIRUS INFECTION 

TECHNICAL FIELD OF THE INVENTION 

The present invention relates generally to the detection and treatment of 
HSV infection. In particular, the invention relates to polypeptides comprising HSV 
antigens, DNA encoding HSV antigens, and the use of such compositions for the 
diagnosis and treatment of HSV infection. 

BACKGROUND OF THE INVENTION 

The herpes viruses include the herpes simplex viruses (HSV), 
comprising two closely related variants designated types 1 (HSV-1) and 2 (HSV-2). 
HSV is a prevalent cause of genital infection in humans, with an estimated annual 
incidence of 600,000 new cases and with 10 to 20 million individuals experiencing 
symptomatic chronic recurrent disease. The asymptomatic subclinical infection rate may 
be even higher. For example, using a type-specific serological assay, 35% of an 
unselected population of women attending a health maintenance organization clinic in 
Atlanta had antibodies to HSV type 2 (HSV-2). Although continuous administration of 
antiviral drugs such as acyclovir ameliorates the severity of acute HSV disease and 
reduces the frequency and duration of recurrent episodes, such chemotherapeutic 
intervention does not abort the establishment of latency nor does it alter the status of the 
latent virus. As a consequence, the recurrent disease pattern is rapidly reestablished 
upon cessation of drug treatment. 

The genome of at least one strain of herpes simplex virus (HSV) has 
been characterized. It is approximately 150 kb and encodes about 85 known genes, 
each of which encodes a protein in the range of 50-1000 amino acids in length. 
Unknown, however, are the immunogenic portions, particularly immunogenic epitopes, 
that are capable of eliciting an effective T cell immune response to viral infection. 

Thus, it is a matter of great medical and scientific need to identify 
immunogenic portions, preferably epitopes, of HSV polypeptides that are capable of 
eliciting an effective immune response to HSV infection. Such information wDl lead to 
safer and more effective prophylactic pharmaceutical compositions, e.g., vaccine 
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compositions, to substantially prevent HSV infections, and, where infection has already 
occurred, therapeutic compositions to combat the disease. The present invention fulfills 
these and other needs. 

SUMMARY OF THE INVENTION 

The present invention provides compositions and methods for the 
diagnosis and therapy of HSV infection. In one aspect, the present invention provides 
polypeptides comprising an immunogenic portion of a HSV antigen, or a variant or 
biological functional equivalent of such an antigen. Certain preferred portions and other 
variants are immunogenic, such that the ability of the portion or variant to react with 
antigen-specific antisera is not substantially diminished. Within certain embodiments, 
the polypeptide comprises an amino acid sequence encoded by a polynucleotide 
sequence selected from the group consisting of (a) a sequence of any one of SEQ ID 
NO: 1, 4, 8-9, 13, 16, 19 24, 35-38, 48-49, and 52-53; (b) a complement of said 
sequence; and (c) sequences that hybridize to a sequence of (a) or (b) under moderately 
stringent conditions. In specific embodiments, the polypeptides of the present invention 
comprise at least a portion, preferably at least an immunogenic portion, of a HSV 
protein that comprises some or all of an amino acid sequence recited in any one of SEQ 
ID NO: 2, 3, 5, 6, 7, 10-12, 14-15, 17-18, 20-23, 25-33, 39-47, 50-51, and 54-64, 
including variants and biological functional equivalents thereof. 

The present invention further provides polynucleotides that encode a 
polypeptide as described above, or a portion thereof (such as a portion encoding at least 
15 contiguous amino acid residues of a HSV protein), expression vectors comprising 
such polynucleotides and host cells transformed or transfected with such expression 
vectors. 

In a related aspect, polynucleotide sequences encoding the above 
polypeptides, recombinant expression vectors comprising one or more of these 
polynucleotide sequences and host cells transformed or transfected with such expression 
vectors are also provided. 

In another aspect, . the present invention provides fusion proteins 
comprising one or more HSV polypeptides, for example in combination with a 
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physiologically acceptable carrier or immunostimulant for use as pharmaceutical 
compositions and vaccines thereof. 

The present invention further provides pharmaceutical compositions that 
comprise: (a) an antibody, either polyclonal and monoclonal, or antigen-binding 
fragment thereof that specifically binds to a HSV. protein; and (b) a physiologically 
acceptable carrier. 

Within other aspects, the present invention provides pharmaceutical 
compositions that comprise one or more HSV polypeptides or portions thereof disclosed 
herein, or a polynucleotide molecule encoding such a polypeptide, and a physiologically 
acceptable carrier. The invention also provides vaccines for prophylactic and 
therapeutic purposes comprising one or more of the disclosed polypeptides and an 
immunostimulant, as defined herein, as well as vaccines comprising one or more 
polynucleotide sequences encoding such polypeptides and an immunostimulant. 

In yet another aspect, methods are provided for inducing protective 
immunity in a patient, comprising administering to a patient an effective amount of one 
or more of the above pharmaceutical compositions or vaccines. Any of the polypeptides 
identified for use in the treatment of patients can be used in conjunction with 
pharmaceutical agents used to, treat herpes infections, such as, but not limited to, 
Zovirax®(AcycIovir), Valtrex® (Valacyclovir), and Famvir® (Famcyclovir). 

In yet a further aspect, there are provided methods for treating, 
substantially preventing or otherwise ameliorating the effects of an HSV infection in a 
patient, the methods comprising obtaining peripheral blood mononuclear cells (PBMC) 
from the patient, incubating the PBMC with a polypeptide of the present invention (or a 
polynucleotide that encodes such a polypeptide) to provide incubated T cells and 
administering the incubated T cells to the patient. The present invention additionally 
provides methods for the treatment of HSV infection that comprise incubating antigen 
presenting cells with a polypeptide of the present invention (or a polynucleotide that 
encodes such a polypeptide) to provide incubated antigen presenting cells and 
administering the incubated antigen presenting cells to the patient. Proliferated cells 
may, but need not, be cloned prior to administration to the patient. In certain 
embodiments, the antigen presenting cells are selected from the group consisting of 
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dendritic cells, macrophages, monocytes, B-cells, and fibroblasts. Compositions for the 
treatment of HSV infection comprising T cells or antigen presenting cells that have been 
incubated with a polypeptide or polynucleotide of the present invention are also 
provided. Within related aspects, vaccines are provided that comprise: (a) an antigen 
presenting cell that expresses a polypeptide as described above and (b) an 
immunostimulant. 

The present invention further provides, within other aspects, methods for 
removing HSV-infected cells from a biological sample, comprising contacting a 
biological sample with T cells that specifically react with a HSV protein, wherein the 
step of contacting is performed under conditions and for a time sufficient to permit the 
removal of cells expressing the protein from the sample. 

Within related aspects, methods are provided for inhibiting the 
development of HSV infection in a patient, comprising administering to a patient a 
biological sample treated as described above. In further aspects of the subject 
invention, methods and diagnostic kits are provided for detecting HSV infection in a 
patient. In one embodiment, the method comprises: (a) contacting a biological sample 
with at least one of the polypeptides or fusion proteins disclosed herein; and (b) 
detecting in the sample the presence of binding agents that bind to the polypeptide or 
fusion protein, thereby detecting HSV infection in the biological sample. Suitable 
biological samples include whole blood, sputum, serum, plasma, saliva, cerebrospinal 
fluid and urine. In one embodiment, the diagnostic kits comprise one or more of the 
polypeptides or fusion proteins disclosed herein in combination with a detection 
reagent. In yet another embodiment, the diagnostic kits comprise either a monoclonal 
antibody or a polyclonal antibody that binds with a polypeptide of the present invention. 

The present invention also provides methods for detecting HSV infection 
comprising: (a) obtaining a biological sample from a patient; (b) contacting the sample 
with at least two oligonucleotide primers in a polymerase chain reaction, at least one of 
the oligonucleotide primers being specific for a polynucleotide sequence disclosed 
herein; and (c) detecting in the sample a polynucleotide sequence that amplifies in the 
presence of the oligonucleotide primers. In one embodiment, the oligonucleotide 
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primer comprises at about 10 contiguous nucleotides of a polynucleotide sequence 
peptide disclosed herein, or of a sequence that hybridizes thereto. 

In a frirther aspect, the present invention provides a method for detecting 
HSV infection in a patient comprising: (a) obtaining a biological sample from the 
patient; (b) contacting the sample with an oligonucleotide probe specific for a 
polynucleotide sequence disclosed herein; and (c) detecting in the sample a 
polynucleotide sequence that hybridizes to the oligonucleotide probe. In one 
embodiment, the oligonucleotide probe comprises at least about 15 contiguous 
nucleotides of a polynucleotide sequence disclosed herein, or a sequence that hybridizes 
thereto. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description. All references disclosed herein are 
hereby incorporated by reference in their entirety as if each was incorporated 
individually. 

SEQUENCE IDENTIFIERS 

SEQ ID NO: 1 sets forth a polynucleotide sequence of an isolated clone 
designated HSV2IJJL39fragH12A12; 

SEQ ID NO: 2 sets forth an amino acid sequence, designated 
H12A12orfl.pro, of an open reading frame encoded within the polynucleotide of SEQ 
ED NO: 1; 

SEQ ID NO: 3 sets forth the amino acid sequence of the full length 
HSV-2 UL39 protein; 

SEQ ED NO: 4 sets forth a polynucleotide sequence of an isolated clone 
designated HSV2njJS8AfragD6.B_Bl 1 JT7Trc.seq; 

SEQ ID NO: 5 sets forth an amino acid sequence, designated 
D6Borfl .pro, of an open reading frame encoded within the polynucleotide of SEQ ID 
NO: 4; 

SEQ ID NO: 6 sets forth an amino acid sequence, designated 
D6Borf2.pro, of an open reading frame encoded within the polynucleotide of SEQ ID 
NO: 4; 
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SEQ ID NO: 7 sets forth the amino acid sequence of the full length 
HSV-2 US 8 A protein; 

SEQ ID NO: 8 sets forth a polynucleotide sequence of an isolated clone 
designated HSV2n_US4fragF10B3_T7Trc.seq; 

SEQ ID NO: 9 sets forth a polynucleotide sequence of an isolated clone 
designated HSV2njUS3fragF10B3 JT7P.seq; 

SEQ ID NO: 10 sets forth an amino acid sequence, designated 
F10B3orf2.pro, of an open reading frame encoded within the polynucleotide of SEQ ID 
NO: 8; 

SEQ ID NO: 1 1 sets forth an amino acid sequence, designated 
8F1 0B3orfl .pro, of an open reading frame encoded within the polynucleotide of SEQ 
ID NO: 9; 

SEQ ID NO: 12 sets forth the amino acid sequence of the full length 
HSV-2 US3 protein; 

SEQ ID NO: 13 sets forth a polynucleotide sequence of an isolated 
clone designated HSV2n_UL46fragFl lF5_T7Trc.seq 

SEQ ID NO: 14 sets forth an amino acid sequence, designated 
Fl lF5orfl .pro, of an open reading frame encoded within the polynucleotide of SEQ ID 
NO: 13; 

SEQ ID NO: 15 sets forth the amino acid sequence of the full length 
HSV-2 UL46 protein; 

SEQ ID NO: 16 sets forth a polynucleotide sequence of an isolated clone 
designated HSV2n_UL27fragH2C7_T7Trc.seq 

SEQ ID NO: 17 sets forth an amino acid sequence, designated 
H2C7orfl .pro, of an open reading frame encoded within the polynucleotide of SEQ ID 
NO: 16; 

SEQ ID NO: 1 8 sets forth the amino acid sequence of the full length 
HSV-2 UL27 protein; 

SEQ ID NO: 19 sets forth a polynucleotide sequence of an isolated clone 
designated HSV2H_UL18fragF10Al_rc.seq; 
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SEQ ID NO: 20 sets forth an amino acid sequence, designated 
Fl OAlorO.pro, of an open reading frame encoded within the polynucleotide of SEQ ID 
NO: 19; 

SEQ ID NO: 21 sets forth an amino acid sequence, designated 
F10Alorf2.pro, of an open reading frame encoded within the polynucleotide of SEQ ID 
NO: 19; 

SEQ ID NO: 22 sets forth an amino acid sequence, designated 
Fl 0A1 orfl .pro, of an open reading frame encoded within the polynucleotide of SEQ ID 
NO: 19; 

SEQ ID NO: 23 sets forth the amino acid sequence of the full length 
HSV-2UL1 8 protein; 

SEQ ID NO: 24 sets forth a polynucleotide sequence of an isolated clone 
designated HSV2njJL15fragF10A12_rc.seq; 

SEQ ID NO: 25 sets forth an amino acid sequence, designated 
F10A12orfl .pro, of an open reading frame encoded within the polynucleotide of SEQ 
ID NO: 24; 

SEQ ID NO: 26 sets forth the amino acid sequence of the full length 
HSV-2UL1 5 protein; 

SEQ ID NO:27 sets forth the amino acid sequence of a 15-mer 
polypeptide derived from an immunogenic portion of the HSVII UL46 gene; 

SEQ ID NO:28 sets forth the amino acid sequence of a 15-mer poly- 
eptide derived from an immunogenic portion of the HSVII UL46 gene; 

SEQ ID NO:29 sets forth the amino acid sequence of a 15-mer 
polypeptide derived from an immunogenic portion of the HSVII UL46 gene; 

SEQ ID NO:30 sets forth the amino acid sequence of a 15-mer 
polypeptide derived from an immunogenic portion of the HSVII UL46 gene; 

SEQ ID NO:3 1 sets forth the amino acid sequence of a 15-mer 
polypeptide derived from an immunogenic portion of the HSVII UL46 gene; 

SEQ ID NO:32 sets forth the amino acid sequence of a 1 5-mer 
polypeptide derived from an immunogenic portion of the HSVII UL1 8 gene; 
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SEQ ID NO:33 sets forth the amino acid sequence of a 1 5-mer 
polypeptide derived from an immunogenic portion of the HSVII UL18 gene; 

SEQ ID NO:34 sets forth a nucleotide sequence of an isolated clone 
designated RL2_E9A4_5_consensus.seq; 

SEQ ID NO:35 sets forth the nucleotide sequence of the full length 
HSV-2 RL2 gene; 

SEQ ID NO:36 sets for the nucleotide sequence of an isolated clone 
designated UL23_22_C 1 2 A 1 2_consensus.seq; 

SEQ ID NO:37 sets forth the nucleotide sequence of the full length 
HSV-2 UL23 protein; 

SEQ ID NO:38 sets forth the nucleotide sequence of the full length 
HSV-2 UL22 protein; 

SEQ ID NO:39 sets forth an amino acid sequence, designated 
HSV2JUL23, of an open reading frame encoded by the polynucleotide of SEQ ID NO: 
37; 

SEQ ID NO:40 sets forth an amino acid sequence designated 
HSV2_UL23 of an open reading frame encoded within the polynucleotides of SEQ ID 
NO:36; 

SEQ ID NO:41 sets forth an amino acid sequence designated 
HSV2_UL22 of an open reading frame encoded within the polynucleotides of SEQ ID 
NO:36; 

SEQ ID NO:42 sets forth the amino acid sequence of a 1 5-mer 
polypeptide derived from an immunogenic portion of the HSVII UL23 gene; 

SEQ ID NO:43 sets forth the amino acid sequence of a 1 5-mer 
polypeptide derived from an immunogenic portion of the HSVII UL23 gene; 

SEQ ID NO:44 sets forth the amino acid sequence of a 1 5-mer 
polypeptide derived from an immunogenic portion of the HSVII UL23 gene; 

SEQ ID NO:45 sets forth an amino acid sequence, designated 
HSV2 JLJL22, of an open reading frame encoded by the polynucleotide of SEQ ID 
NO:38; 
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SEQ ID NO:46 sets forth an amino acid sequence, designated 
RL2_E9A4_5_consensus.seq, of an open reading frame encoded by the polynucleotide 
of SEQIDNO:34; 

SEQ ID NO:47 sets forth an amino acid sequence, designated 
HSV2_RL2, of an open reading frame encoded by the polynucleotide of SEQ ID 
NO:35; 

SEQ ID NO:48 sets forth a nucleotide sequence of an isolated clone 
designated G10_UL37consensus.seq; 

SEQ ID NO:49 sets forth the nucleotide sequence of the full length 
HSV-2 UL37 gene; 

SEQ ID NO:50 sets forth an amino acid sequence, designated 
HSV2JUL37, of an open reading frame encoded by the polynucleotide of SEQ ID 
NO:48; and 

SEQ ID NO:5 1 sets forth an amino acid sequence, designated 
HSV2JJL37, of an open reading frame encoded by the polynucleotide of SEQ ID 
NO:49; 

SEQ ID NO:52 sets forth the DNA sequence derived from the insert of 
clone UL46fragFllF5; 

SEQ H) NO:53 sets forth the DNA sequence derived from the insert of 

clone G10; 

SEQ ID NO:54 sets forth the amino acid sequence derived from the 
insert of clone UL46fragFl 1F5; 

SEQ ID NO:55 sets forth the amino acid sequence derived from the 
insert of clone G10; 

SEQ ID NO:56 is amino acid sequence of peptide #23 (amino acids 688- 
702) of the HSV-2 gene UL15; 

SEQ ID NO:57 is amino acid sequence of peptide #30 (amino acids 716- 
730) of the HSV-2 gene UL15; 

SEQ ID NO:58 is amino acid sequence of peptide #7 (amino acids 265- 
279) of the HSV-2 gene UL23; 
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SEQ ID NO:59 is amino acid sequence of peptide #2 (amino acids 621- 
635) of the HSV-2 gene UL46; 

SEQ ID NO:60 is amino acid sequence of peptide #8 (amino acids 645- 
659) of the HSV-2 gene UL46; 

SEQ ID NO:61 is amino acid sequence of peptide #9 (amino acids 649- 
663) of the HSV-2 gene UL46; 

SEQ ID NO:62 is amino acid sequence of peptide #1 1 (amino acids 657- 
671) of the HSV-2 gene UL46; 

SEQ ID NO:63 is amino acid sequence of peptide #33 (amino acids 262- 
276) of the HSV-2 gene US3; 

SEQ ID NO:64 is amino acid sequence of peptide #5 (amino acids 99- 
113) of the HSV-2 gene US8A. 

DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS 

As noted above, the present invention is generally directed to 
compositions and methods for making and using the compositions, particularly in the 
therapy and diagnosis of HSV infection. Certain illustrative compositions described 
herein include HSV polypeptides, polynucleotides encoding such polypeptides, binding 
agents such as antibodies, antigen presenting cells (APCs) and/or immune system cells 
(e.g., T cells). Certain HSV proteins and immunogenic portions thereof comprise HSV 
polypeptides that react detectably (within an immunoassay, such as an ELISA or 
Western blot) with antisera of a patient infected with HSV. 

Therefore, the present invention provides illustrative polynucleotide 
compositions, illustrative .polypeptide compositions, immunogenic portions of said 
polynucleotide and polypeptide compositions, antibody compositions capable of 
binding such polypeptides, and numerous additional embodiments employing such 
compositions, for example in the detection, diagnosis and/or therapy of human HSV 
infections. 

Polynucleotide Compositions 

As used herein, the terms "DNA segment" and "polynucleotide" refer to 
a DNA molecule that has been isolated free of total genomic DNA of a particular 
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species. Therefore, a DNA segment encoding a polypeptide refers to a DNA segment 
that contains one or more coding sequences yet is substantially isolated away from, or 
purified free from, total genomic DNA of the species from which the DNA segment is 
obtained. Included within the terms "DNA segment" and "polynucleotide" are DNA 
segments and smaller fragments of such segments, and also recombinant vectors, 
including, for example, plasmids, cosmids, phagemids, phage, viruses, and the like. 

As will be understood by those skilled in the art, the DNA segments of 
this invention can include genomic sequences, extra-genomic and plasmid-encoded 
sequences and smaller engineered gene segments that express, or may be adapted to 
express, proteins, polypeptides, peptides and the like. Such segments may be naturally 
isolated, or modified synthetically by the hand of man. 

"Isolated," as used herein, means that a polynucleotide is substantially 
away from other coding sequences, and that the DNA segment does not contain large 
portions of unrelated coding DNA, such as large chromosomal fragments or other 
functional genes or polypeptide coding regions. Of course, this refers to the DNA 
segment as originally isolated, and does not exclude genes or coding regions later added 
to the segment by the hand of man. 

As will be recognized by the skilled artisan, polynucleotides may be 
single-stranded (coding or antisense) or double-stranded, and may be DNA (genomic, 
cDNA or synthetic) or RNA molecules. RNA molecules include HnRNA molecules, 
which contain introns and correspond to a DNA molecule in a one-to-one manner, and 
mRNA molecules, which do not contain introns. Additional coding or non-coding 
sequences may, but need not, be present within a polynucleotide of the present 
invention, and a polynucleotide may, but need not, be linked to other molecules and/or 
support materials. 

Polynucleotides may comprise a native sequence (i.e., an endogenous 
sequence that encodes an HSV protein or a portion thereof) or may comprise a variant, 
or a biological or antigenic functional equivalent of such a sequence. Polynucleotide 
variants may contain one or more substitutions, additions, deletions and/or insertions, as 
further described below, preferably such that the immunogenicity of the encoded 
polypeptide is not diminished, relative to a native HSV protein. The effect on the 
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immunogenicity of the encoded polypeptide may generally be assessed as described 
herein. The term "variants" also encompasses homologous genes of xenogenic origin. 

When comparing polynucleotide or polypeptide sequences, two 
sequences are said to be "identical" if the sequence of nucleotides or amino acids in the 
two sequences is the same when aligned for maximum correspondence, as described 
below. Comparisons between two sequences are typically performed by comparing the 
sequences over a comparison window to identify and compare local regions of sequence 
similarity. A "comparison window" as used herein, refers to a segment of at least about 
20 contiguous positions, usually 30 to about 75, 40 to about 50, in which a sequence 
may be compared to a reference sequence of the same number of contiguous positions 
after the two sequences are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using 
the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 
Inc., Madison, WI), using default parameters. This program embodies several 
alignment schemes described in the following references: Dayhoff, M.O. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 
In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 
Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 
vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:11-17; Robinson, 
E.D. (1971) Comb. Theor 11:105; Santou, N. Nes, M. (1987) Mol Biol Evoi 4:406- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 
Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, WJ. and 
Lipman, D.J. (1983) Proc. Natl Acad, Set USA 50:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add. APL. 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 
Mol Biol 48:443, by the search for similarity methods of Pearson and Lipman (1988) 
Proc. Natl. Acad. Set USA 85: 2444, by computerized implementations of these 
. algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
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Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

One preferred example of algorithms that are suitable for determining 
percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
algorithms, which are described in Altschul et al. (1977) Nucl Acids Res. 25:3389-3402 
and Altschul et al. (1990) J. Mol Biol 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to determine percent 
sequence identity for the polynucleotides and polypeptides of the invention. Software 
for performing BLAST analyses is publicly available through the National Center for 
Biotechnology Information. In one illustrative example, cumulative scores can be 
calculated using, for nucleotide sequences, the parameters M (reward score for a pair of 
matching residues; always >0) and N (penalty score for mismatching residues; always 
<0). For amino acid sequences, a scoring matrix can be used to calculate the cumulative 
score. Extension of the word hits in each direction are halted when: the cumulative 
alignment score falls off by the quantity X from its maximum achieved value; the 
cumulative score goes to zero or below, due to the accumulation of one or more 
negative-scoring residue alignments; or the end of either sequence is reached. The 
BLAST algorithm parameters W, T and X determine the sensitivity and speed of the 
alignment. The BLASTN program (for nucleotide sequences) uses as defaults a 
wordlength (W) of 1 1, and expectation (E) of 10, and the BLOSUM62 scoring matrix 
(see Henikoff and Henikoff (1989) Proc. Natl Acad Set USA 89:10915) alignments, 
(B) of 50, expectation (E) of 10, M=5, N=-4 and a comparison of both strands. 

Preferably, the "percentage of sequence identity" is determined by 
comparing two optimally aligned sequences over a window of comparison of at least 20 
positions, wherein the portion of the polynucleotide or polypeptide sequence in the 
comparison window may comprise additions or deletions (i.e., gaps) of 20 percent or 
less, usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference 
sequences (which does not comprise additions or deletions) for optimal alignment of the 
two sequences. The percentage is calculated by determining the number of positions at 
which the identical nucleic acid bases or amino acid residue occurs in both sequences to 
yield the number of matched positions, dividing the number of matched positions by the 
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total number of positions in the reference sequence (i.e., the window size) and 
multiplying the results by 100 to yield the percentage of sequence identity. 

Therefore, the present invention encompasses polynucleotide and 
polypeptide sequences having substantial identity to the sequences disclosed herein, for 
example those comprising at least 50% sequence identity, preferably at least 55%, 60%, 
65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% or higher, sequence 
identity compared to a polynucleotide or polypeptide sequence of this invention using 
the methods described herein, (e.g., BLAST analysis using standard parameters, as 
described below). One skilled in this art will recognize that these values can be 
appropriately adjusted to determine corresponding identity of proteins encoded by two 
nucleotide sequences by taking into account codon degeneracy, amino acid similarity, 
reading frame positioning and the like. 

In additional embodiments, the present invention provides isolated 
polynucleotides and polypeptides comprising various lengths of contiguous stretches of 
sequence identical to or complementary to one or more of the sequences disclosed 
herein. For example, polynucleotides are provided by this invention that comprise at 
least about 15, 20, 30, 40, 50, 75, 100, 150, 200, 300, 400, 500 or 1000 or more 
contiguous nucleotides of one or more of the sequences disclosed herein as well as all 
intermediate lengths there between. It will be readily understood that "intermediate 
lengths", in this context, means any length between the quoted values, such as 16, 17, 
18, 19, etc.; 21, 22, 23, etc.; 30, 31, 32, etc.; 50, 51, 52, 53, etc; 100, 101, 102, 103, 
etc.; 150, 151, 152, 153, etc; including all integers through 200-500; 500-1,000, and the 
like. 

The. polynucleotides of the present invention, or fragments thereof, 
regardless of the length of the coding sequence itself, may be combined with other DNA 
sequences, such as promoters, polyadenylation signals, additional restriction enzyme 
sites, multiple cloning sites, other coding segments, and the like, such that their overall 
length may vary considerably. It is therefore contemplated that a nucleic acid fragment 
of almost any length may be employed, with the total length preferably being limited by 
the ease of preparation and use in the intended recombinant DNA protocol. For 
example, illustrative DNA segments with total lengths of about 10,000, about 5000, 
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about 3000, about 2,000, about 1,000, about 500, about 200, about 100, about 50 base 
pairs in length, and the like, (including all intermediate lengths) are contemplated to be 
useful in many implementations of this invention. 

In other embodiments, the present invention is directed to 
polynucleotides that are capable of hybridizing under moderately stringent conditions to 
a polynucleotide sequence provided herein, or a fragment thereof, or a complementary 
sequence thereof. Hybridization techniques are well known in the art of molecular 
biology. For purposes of illustration, suitable . moderately stringent conditions for 
testing the hybridization of a polynucleotide of this invention with other polynucleotides 
include prewashing in a solution of 5 X SSC, 0.5% SDS, L0 mM EDTA (pH 8.0); 
hybridizing at 50°C-65°C, 5 X SSC, overnight; followed by washing twice at 65°C for 
20 minutes with each of 2X, 0.5X and 0.2X SSC containing 0.1% SDS. 

Moreover, it will be appreciated by those of ordinary skill in the art that, 
as a result of the degeneracy of the genetic code, there are many nucleotide sequences 
that encode a polypeptide as described herein. Some of these polynucleotides bear 
minimal homology to the nucleotide sequence of any native gene. Nonetheless, 
polynucleotides that vary due to .differences in codon usage are specifically 
contemplated by the present invention. Further, alleles of the genes comprising the 
polynucleotide sequences provided herein are within the scope of the present invention. 
Alleles are endogenous genes that are altered as a result of one or more mutations, such 
as deletions, additions and/or substitutions of nucleotides. The resulting mRNA and 
protein may, but need not, have an altered structure or function. Alleles may be 
identified using standard techniques (such as hybridization, amplification and/or 
database sequence comparison). 

Probes and Primers 

In other embodiments of the present invention, the polynucleotide 
sequences provided herein can be advantageously used as probes or primers for nucleic 
acid hybridization. As such, it is contemplated that nucleic acid segments that comprise 
a sequence region of at least about 15 nucleotide long contiguous sequence that has the 
same sequence as, or is complementary to, a 15 nucleotide long contiguous sequence 
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disclosed herein will find particular utility. Longer contiguous identical or 
complementary sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 500, 1000 
(including all intermediate lengths) and even up to full length sequences will also be of 
use in certain embodiments. 

The ability of such nucleic acid probes to specifically hybridize to a 
sequence of interest will enable them to be of use in detecting the presence of 
complementary sequences in a given sample. However, other, uses are also envisioned, 
such as the use of the sequence information for the preparation of mutant species 
primers, or primers for use in preparing other genetic constructions. 

Polynucleotide molecules having sequence regions consisting of 
contiguous nucleotide stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides 
or so (including intermediate lengths as well), identical or complementary to a 
polynucleotide sequence disclosed herein, are particularly contemplated as hybridization 
probes for use in, e.g., Southern and Northern blotting. This would allow a gene 
product, or fragment thereof, to be analyzed, both in diverse cell types and also in 
various bacterial cells. The total size of fragment, as well as the size of the 
complementary stretch(es), will ultimately depend on the intended use or application of 
the particular nucleic acid segment. Smaller fragments will generally find use in 
hybridization embodiments, wherein the length of the contiguous complementary region 
may be varied, such as between about 15 and about 100 nucleotides, but larger 
contiguous complementarity stretches may be used, according to the length 
complementary sequences one wishes to detect. 

The use of a hybridization probe of about 15-25 nucleotides in length 
allows the formation of a duplex molecule that is both stable and selective. Molecules 
having contiguous complementary sequences over stretches greater than 15 bases in 
length are generally preferred, though, in order to increase stability and selectivity of the 
hybrid, and thereby improve the quality and degree of specific hybrid molecules 
obtained. One will generally prefer to design nucleic acid molecules having gene- 
complementary stretches of 15 to 25 contiguous nucleotides, or even longer where 
desired. 
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Hybridization probes may be selected from any portion of any of the 
sequences disclosed herein. All that is required is to review the sequence set forth in 
the sequences disclosed herein, or to any continuous portion of the sequence, from 
about 15-25 nucleotides in length up to and including the full length sequence, that one 
wishes to utilize as a probe or primer. The choice of probe and primer sequences may 
be governed by various factors. For example, one may wish to employ primers from 
towards the termini of the total sequence. 

Small polynucleotide segments or fragments may be readily prepared by, 
for example, directly synthesizing the fragment by chemical means, as is commonly 
practiced using an automated oligonucleotide synthesizer. Also, fragments may be 
obtained by application of nucleic acid reproduction technology, such as the PCR™ 
technology of U. S. Patent 4,683,202 (incorporated herein by reference), by introducing 
selected sequences into recombinant vectors for recombinant production, and by other 
recombinant DNA techniques generally known to those of skill in the art of molecular 
biology. 

The nucleotide sequences of the invention may be used for their ability to 
selectively form duplex molecules with complementary stretches of the entire gene or 
gene fragments of interest. Depending on the application envisioned, one will typically 
desire to employ varying conditions of hybridization to achieve varying degrees of 
selectivity of probe towards target sequence. For applications requiring high selectivity, 
one will typically desire to employ relatively stringent conditions to form the hybrids, 
e.g., one will select relatively low salt and/or high temperature conditions, such as 
provided by a salt concentration of from about 0.02 M to about 0.15 M salt at 
temperatures of from about 50°C to about 70°C. Such selective conditions tolerate 
little, if any, mismatch between the probe and the template or target strand, and would 
be particularly suitable for isolating related sequences. 

Of course, for some applications, for example, where one desires to 
prepare mutants employing a mutant primer strand hybridized to an underlying 
template, less stringent (reduced stringency) hybridization conditions will typically be 
needed in order to allow formation of the heteroduplex. In these circumstances, one 
may desire to employ salt conditions such as those of from about 0.15 M to about 0.9 M 
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salt, at temperatures ranging from about 20°C to about 55°C. Cross-hybridizing species 
can thereby be readily identified as positively hybridizing signals with respect to control 
hybridizations. In any case, it is generally appreciated that conditions can be rendered 
more stringent by the addition of increasing amounts of formamide, which serves to 
destabilize the hybrid duplex in the same manner as increased temperature. Thus, 
hybridization conditions can be readily manipulated, and thus will generally be a 
method of choice depending on the desired results. 

Polynucleotide Identification and Characterization 

Polynucleotides may be identified, prepared and/or manipulated using 
any of a variety of well established techniques. For example, a polynucleotide may be 
identified, as described in more detail below, by screening a microarray of cDNAs for 
HSV-associated expression (i.e., expression that is at least two fold greater in infected 
versus normal tissue, as determined using a representative assay provided herein). Such 
screens may be performed, for example, using a Synteni microarray (Palo Alto, CA) 
according to the manufacturer's instructions (and essentially as described by Schena et 
al„ Proc. Natl. Acad. Sci. USA 93:10614-10619, 1996 and Heller et al, Proc. Natl. 
Acad. Sci. USA P4:2150-2155, 1997). Alternatively, polynucleotides may be amplified 
from cDNA prepared from cells expressing the proteins described herein. Such 
polynucleotides may be amplified via polymerase chain reaction (PCR). For this 
approach, sequence-specific primers may be designed based on the sequences provided 
herein, and may be purchased or synthesized. 

An amplified portion of a polynucleotide of the present invention may be 
used to isolate a full length gene from a suitable library (e.g., an HSV cDNA library) 
using well known techniques. Within such techniques, a library (cDNA or genomic) is 
screened using one or more polynucleotide probes or primers suitable for amplification. 
Preferably, a library is size-selected to include larger molecules. Random primed 
libraries may also be preferred for identifying 5 1 and upstream regions of genes. 
Genomic libraries are preferred for obtaining introns and extending 5' sequences. 

For hybridization techniques, a partial sequence may be labeled (e.g., by 
nick-translation or end-labeling with 32 P) using well known techniques. A bacterial or 
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bacteriophage library is then generally screened by hybridizing filters containing 
denatured bacterial colonies (or lawns containing phage plaques) with the labeled probe 
(see Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratories, Cold Spring Harbor, NY, 1989). Hybridizing colonies or plaques are 
selected and expanded, and the DNA is isolated for further analysis. cDNA clones may 
be analyzed to determine the amount of additional sequence by, for example, PCR using 
a primer from the partial sequence and a primer from the vector. Restriction maps and 
partial sequences may be generated to identify one or more overlapping clones. The 
complete sequence may then be determined using standard techniques, which may 
involve generating a series of deletion clones. The resulting overlapping sequences can 
then assembled into a single contiguous sequence. A full length cDNA molecule can be 
generated by ligating suitable fragments, using well known techniques. 

Alternatively, there are numerous amplification techniques for obtaining 
a full length coding sequence from a partial cDNA sequence. Within such techniques, 
amplification is generally performed via PCR. Any of a variety of commercially 
available kits may be used to perform the amplification step. Primers may be designed 
using, for example, software well known in the art. Primers are preferably 22*30 
nucleotides in length, have a GC content of at least 50% and anneal to the target 
sequence at temperatures of about 68°C to 72°C. The amplified region may be 
sequenced as described above, and overlapping sequences assembled into a contiguous 
sequence. 

One such amplification technique is inverse PCR (see Triglia et al., NucL 
Acids Res. 16:$l&6, 1988), which uses restriction enzymes to generate a fragment in the 
known region of the gene. The fragment is then circularized by intramolecular ligation 
and used as a template for PCR with divergent primers derived from the known region. 
Within an alternative approach, sequences adjacent to a partial sequence may be 
retrieved by amplification with a primer to a linker sequence and a primer specific to a 
known region. The amplified sequences are typically subjected to a second round of 
amplification with the same linker primer and a second primer specific to the known 
region. A variation on this procedure,, which employs two primers that initiate 
extension in opposite directions from the known sequence, is described in WO 
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96/38591. Another such technique is known as "rapid amplification of cDNA ends" or 
RACE. This technique involves the use of an internal primer and an external primer, 
which hybridizes to a polyA region or vector sequence, to identify sequences that are 5 ! 
and 3 1 of a known sequence. Additional techniques include capture PCR (Lagerstrom et 
al, PCR Methods Applic. 7:1 11-19, 1991) and walking PCR (Parker et al., Nucl Acids. 
Res. 7P:3055-60, 1991). Other methods employing amplification may also be employed 
to obtain a full length cDNA sequence. 

In certain instances, it is possible to obtain a full length cDNA sequence 
by analysis of sequences provided in an expressed sequence tag (EST) database, such as 
that available from GenBank. Searches for overlapping ESTs may generally be 
performed using well known programs (e.g., NCBI BLAST searches), and such ESTs 
may be used to generate a contiguous full length sequence. Full length DNA sequences 
may also be obtained by analysis of genomic fragments. 

Polynucleotide Expression in Host Cells 

In other embodiments of the invention, polynucleotide sequences or 
fragments thereof which encode polypeptides of the invention, or fusion proteins or 
functional equivalents thereof, may be used in recombinant DNA molecules to direct 
expression of a polypeptide in appropriate host cells. Due to the inherent degeneracy of 
the genetic code, other DNA sequences that encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and these sequences may 
be used to clone and express a given polypeptide. 

As will be understood by those of skill in the art, it may be advantageous in 
some instances to produce polypeptide-encoding nucleotide sequences possessing non- 
naturally occurring codons. For example, codons preferred by a particular prokaryotic or 
eukatyotic host can be selected to increase the rate of protein expression or to produce a 
recombinant RNA transcript having desirable properties, such as a half-life which is 
longer than that of a transcript generated from the naturally occurring sequence. 

Moreover, the polynucleotide sequences of the present invention can be 
engineered using methods generally known in the art in order to alter polypeptide 
encoding sequences for a variety of reasons, including but not limited to, alterations 
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which modify the cloning, processing, and/or expression of the gene product. For 
example, DNA shuffling by random fragmentation and PCR reassembly of gene 
fragments and synthetic oligonucleotides may be used to engineer the nucleotide 
sequences. In addition, site-directed mutagenesis may be used to insert new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, or 
introduce mutations, and so forth. 

In another embodiment of the invention, natural, modified, or 
recombinant nucleic acid sequences may be ligated to a heterologous sequence to 
encode a fusion protein. For example, to screen peptide libraries for inhibitors of 
polypeptide activity, it may be useful to encode a chimeric protein that can be 
recognized by a commercially available antibody. A fusion protein may also be 
engineered to contain a cleavage site located between the polypeptide-encoding 
sequence and the heterologous protein sequence, so that the polypeptide may be cleaved 
and purified away from the heterologous moiety. 

Sequences encoding a desired polypeptide may be synthesized, in whole 
or in part, using chemical methods well known in the art (see Caruthers, M. H. et al. 
(1980) Nucl Acids Res. Symp. Ser. 215-223, Horn, T. et al. (1980) Nucl Acids Res. 
Symp. Ser. 225-232). Alternatively, the protein itself may be produced using chemical 
methods to synthesize the amino acid sequence of a polypeptide, or a portion thereof. 
For example, peptide synthesis can be performed using various solid-phase techniques 
(Roberge, J. Y. et al. (1995) Science 2<5P:202-204) and automated synthesis may be 
achieved, for example, using the ABI 431 A Peptide Synthesizer (Perkin Elmer, Palo 
Alto, CA). 

A newly synthesized peptide may be substantially purified by preparative 
high performance liquid chromatography (e.g., Creighton, T. (1983) Proteins, Structures 
and Molecular Principles, WH Freeman and Co., New York, N.Y.) or other comparable 
techniques available in the art. The composition of the synthetic peptides may be 
confirmed by amino acid analysis or sequencing (e.g., the Edman degradation 
procedure). Additionally, the amino acid sequence of a polypeptide, or any part thereof, 
may be altered during direct synthesis and/or combined using chemical methods with 
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sequences from other proteins, or any part thereof, to produce a variant polypeptide. 

In order to express a desired polypeptide, the nucleotide sequences 
encoding the polypeptide, or functional equivalents, may be inserted into appropriate 
expression vector, i.e., a vector which contains the necessary elements for the 
transcription and translation of the inserted coding sequence. Methods which are well 
known to those skilled in the art may be used to construct expression vectors containing 
sequences encoding a polypeptide of interest and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DNA 
techniques, synthetic techniques, and in vivo genetic recombination. Such techniques 
are described in Sambrook, J. et al. (1989) Molecular Cloning, A Laboratory Manual, 
Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et al. (1989) Current 
Protocols in Molecular Biology, John Wiley & Sons, New York. N.Y. 

A variety of expression vector/host systems may be utilized to contain 
and express polynucleotide sequences. These include, but are not limited to, 
microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, 
or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; 
insect cell systems infected with virus expression vectors (e.g., baculovirus); plant cell 
systems transformed with virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or 
pBR322 plasmids); or animal cell systems. 

The "control elements" or "regulatory sequences" present in an 
expression vector are those non-translated regions of the vector— enhancers, promoters, 
5 f and 3' untranslated regions-which interact with host cellular proteins to carry out 
transcription and translation. Such elements may vary in their strength and specificity. 
Depending on the vector system and host utilized, any number of suitable transcription 
and translation elements, including constitutive and inducible promoters, may be used. 
For example, when cloning in bacterial systems, inducible promoters such as the hybrid 
lacZ promoter of the PBLUESCRIPT phagemid (Stratagene, La Jolla, Calif.) or 
PSPORT1 plasmid (Gibco BRL, Gaithersburg, MD) and the like may be used. In 
mammalian cell systems, promoters from mammalian genes or from mammalian 
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viruses are generally preferred. If it is necessary to generate a cell line that contains 
multiple copies of the sequence encoding a polypeptide, vectors based on SV40 or EBV 
may be advantageously used with an appropriate selectable marker. 

In bacterial systems, a number of expression vectors may be selected 
depending upon the use intended for the expressed polypeptide. For example, when 
large quantities are needed, for example for the induction of antibodies, vectors which 
direct high level expression of fusion proteins that are readily purified may be used. 
Such vectors include, but are not limited to, the multifunctional E. coli cloning and 
expression vectors such as BLUESCRIPT (Stratagene), in which the sequence encoding 
the polypeptide of interest may be ligated into the vector in frame with sequences for 
the amino-terminal Met and the subsequent 7 residues of .beta.-galactosidase so that a 
hybrid protein is produced; pIN vectors (Van Heeke, G. and S. M. Schuster (1989) J. 
Biol Chem. 264:5503-5509); and the like. pGEX Vectors (Promega, Madison, Wis.) 
may also be used to express foreign polypeptides as fusion proteins with glutathione S- 
transferase (GST). In general, such fusion proteins are soluble and can easily be purified 
from lysed cells by adsorption to glutathione-agarose beads followed by elution in the 
presence of free glutathione. Proteins made in such systems may be designed to include 
heparin, thrombin, or factor XA protease cleavage sites so that the cloned polypeptide 
of interest can be released from the GST moiety at will. 

In the yeast, Saccharomyces cerevisiae, a number of vectors containing 
constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH may 
be used. For reviews, see Ausubel et al. (supra) and Grant et al. (1987) Methods 
EnzytnoL 153:516-544. 

In cases where plant expression vectors are used, the expression of 
sequences encoding polypeptides may be driven by any of a number of promoters. For 
example, viral promoters such as the 35S and 19S promoters of CaMV may be used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. 
(1987) EMBO J. (5:307-31 1. Alternatively, plant promoters such as the small subunit of 
RUBISCO or heat shock promoters may be used (Coruzzi, G. et al. (1984) EMBO J. 
5:1671-1680; Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) 
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Results Probl Cell Differ. 77:85-105). These constructs can be introduced into plant 
cells by direct DNA transformation or pathogen-mediated transfection. Such techniques 
are described in a number of generally available reviews (see, for example, Hobbs, S. or 
Murry, L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw 
Hill, New York, N.Y.; pp. 191-196). 

An insect system may also be used to express a polypeptide of interest. 
For example, in one such system, Autographa californica nuclear polyhedrosis virus 
(AcNPV) is used as a vector to express foreign genes in Spodoptera frugiperda cells or 
in Trichoplusia larvae. The sequences encoding the polypeptide may be cloned into a 
non-essential region of the virus, such as the polyhedrin gene, and placed under control 
of the polyhedrin promoter. Successful insertion of the polypeptide-encoding sequence 
will render the polyhedrin gene inactive and produce recombinant virus lacking coat 
protein. The recombinant viruses may then be used to infect, for example, S. frugiperda 
cells or Trichoplusia larvae in which the polypeptide of interest may be expressed 
(Engelhard, E. K. et al. (1994) Proc. Natl. Acad Sci. 91 :3224-3227). 

In mammalian host cells, a number of viral-based expression systems are 
generally available. For example, in cases where an adenovirus is used as an expression 
vector, sequences encoding a polypeptide of interest may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used 
to obtain a viable virus which is capable of expressing the polypeptide in infected host 
cells (Logan, J. and Shenk, T. (1984) Proc. Natl Acad. Sci. 57:3655-3659). In addition, 
transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used 
to increase expression in mammalian host cells. 

Specific initiation signals may also be used to achieve more efficient 
translation of sequences encoding a polypeptide of interest. Such signals include the 
ATG initiation codon and adjacent sequences. In cases where sequences encoding the 
polypeptide, its initiation codon, and upstream sequences are inserted into the 
appropriate expression vector, no additional transcriptional or translational control 
signals may be needed. However, in cases where only coding sequence, or a portion 
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thereof, is inserted, exogenous translational control signals including the ATG initiation 
codon should be provided. Furthermore, the initiation codon should be in the correct 
reading frame to ensure translation of the entire insert. Exogenous translational 
elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers which are 
appropriate for the particular cell system which is used, such as those described in the 
literature (Scharf, D. et al. (1994) Results Probl Cell Differ. 20:125-162). 

In addition, a host cell strain may be chosen for its ability to modulate 
the expression of the inserted sequences or to process the expressed protein in the 
desired fashion. Such modifications of the polypeptide include, but are not limited to, 
acetylation, carboxylation. glycosylation, phosphorylation, lipidation, and acylation. 
Post-translational processing which cleaves a "prepro" form of the protein may also be 
used to facilitate correct insertion, folding and/or function. Different host cells such as 
CHO, HeLa, MDCK, HEK293, and WI38, which have specific cellular machinery and 
characteristic mechanisms for such post-translational activities, may be chosen to ensure 
the correct modification and processing of the foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
expression is generally preferred. For example, cell lines which stably express a 
polynucleotide of interest may be transformed using expression vectors which may 
contain viral origins of replication and/or endogenous expression elements and a 
selectable marker gene on the same or on a separate vector. Following the introduction 
of the vector, cells may be allowed to grow for 1-2 days in an enriched media before 
they are switched to selective media. The purpose of the selectable marker is to confer 
resistance to selection, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed 
cells may be proliferated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed 
cell lines. These include, but are not limited to, the herpes simplex virus thymidine 
kinase (Wigler, M. et al. (1977) Cell 77:223-32) and adenine phosphoribosyltransferase 
(Lowy, I. et al. (1990) Cell 22:817-23) genes which can be employed in tk.sup.- or 
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aprtsup.- cells, respectively. Also, antimetabolite, antibiotic or herbicide resistance can 
be used as the basis for selection; for example, dhfir which confers resistance to 
methotrexate (Wigler, M. et al. (1980) Proc. Natl Acad. Sci. 77:3567-70); npt, which 
confers resistance to the aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. et 
al (1981) J. Mol Biol. 750:1-14); and als or pat, which confer resistance to 
chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, supra). 
Additional selectable genes have been described, for example, trpB, which allows cells 
to utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol in 
place of histidine (Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl Acad Sci. 
55:8047-51). Recently, the use of visible markers has gained popularity with such 
markers as anthocyanins, beta-glucuronidase and its substrate GUS, and luciferase and 
its substrate luciferin, being widely used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific 
vector system (Rhodes, C. A. et al. (1995) Methods Mol Biol 55:121-131). 

Although the presence/absence of marker gene expression suggests that 
the gene of interest is also present, its presence and expression may need to be 
confirmed. For example, if the sequence encoding a polypeptide is inserted within a 
marker gene sequence, recombinant cells containing sequences can be identified by the 
absence of marker gene function. Alternatively, a marker gene can be placed in tandem 
with a polypeptide-encoding sequence under the control of a single promoter. 
Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

Alternatively, host cells which contain and express a desired 
polynucleotide sequence may be identified by a variety of procedures known to those of 
skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA- 
RNA hybridizations and protein bioassay or immunoassay techniques which include 
membrane, solution, or chip based technologies for the detection and/or quantification 
of nucleic acid or protein. 

A variety of protocols for detecting and measuring the expression of 
polynucleotide-encoded products, using either polyclonal or monoclonal antibodies 
specific for the product are known in the art. Examples include enzyme-linked 



WO 02/02131 



PCTYUS01/20981 



27 

immunosorbent assay (ELISA), radioimmunoassay (RIA), and fluorescence activated 
cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on a given polypeptide may be 
preferred for some applications, but a competitive binding assay may also be employed. 
These and other assays are described, among other places, in Hampton, R. et aL (1990; 
Serological Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, 
D. E. et al. (1983; J. Exp. Med 755:1211-1216). 

A wide variety of labels and conjugation techniques are known by those 
skilled in the art and may be used in various nucleic acid and amino acid assays. Means 
for producing labeled hybridization or PCR probes for detecting sequences related to 
polynucleotides include oligolabeling, nick translation, end-labeling or PCR 
amplification using a labeled nucleotide. Alternatively, the sequences, or any portions 
thereof may be cloned into a vector for the production of an mRNA probe. Such vectors 
are known in the art, are commercially available, and may be used to synthesize RNA 
probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of 
commercially available kits. Suitable reporter molecules or labels, which may be used 
include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents 
as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with a polynucleotide sequence of interest may be 
cultured under conditions suitable for the expression and recovery of the protein from 
cell culture. The- protein produced by a recombinant cell may be secreted or contained 
intracellularly depending on the sequence and/or the vector used. As will be understood 
by those of skill in the art, expression vectors containing polynucleotides of the 
invention may be designed to contain signal sequences which direct secretion of the 
encoded polypeptide through a prokaryotic or eukaryotic cell membrane. Other 
recombinant constructions may be used to join sequences encoding a polypeptide of 
interest to nucleotide sequence encoding a polypeptide domain which will facilitate 
purification of soluble proteins. Such purification facilitating domains include, but are 
not limited to, metal chelating peptides such as histidine-tryptophan modules that allow 
purification on immobilized metals, protein A domains that allow purification on 
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immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 
purification system (Immuiiex Corp., Seattle, Wash.). The inclusion of cleavable linker 
sequences such as those specific for Factor XA or enterokinase (Invitrogen. San Diego, 
Calif.) between the purification domain and the encoded polypeptide may be used to 
facilitate purification. One such expression vector provides for expression of a fusion 
protein containing a polypeptide of interest and a nucleic acid encoding 6 histidine 
residues preceding a thioredoxin or an enterokinase cleavage site. The histidine residues 
facilitate purification on IMIAC (immobilized metal ion affinity chromatography) as 
described in Porath, J. et al. (1992, Prot. Exp. Purif 5:263-281) while the enterokinase 
cleavage site provides a means for purifying the desired polypeptide from the fusion 
protein. A discussion of vectors which contain fusion proteins is provided in Kroll, D. J. 
et al. (1993; DNA Cell Biol 72:441-453). 

In addition to recombinant production methods, polypeptides of the 
invention, and fragments thereof, may be produced by direct peptide synthesis using 
solid-phase techniques (Merrifield J. (1963) 7. Am. Chem. Soc. 55:2149-2154). Protein 
synthesis may be performed using manual techniques or by automation. Automated 
synthesis may be achieved, for example, using Applied Biosystems 431 A Peptide 
Synthesizer (Perkin Elmer). Alternatively, various fragments may be chemically 
synthesized separately and combined using chemical methods to produce the full length 
molecule. 

Site-Specific Mutagenesis 

Site-specific mutagenesis is a technique useful in the preparation of 
individual peptides, or biologically functional equivalent polypeptides, through specific 
mutagenesis of the underlying polynucleotides that encode them. The technique, 
well-known to those of skill in the art, further provides a ready ability to prepare and 
test sequence variants, for example, incorporating one or more of the foregoing 
considerations, by introducing one or more nucleotide sequence changes into the DNA. 
Site-specific mutagenesis allows the production of mutants through the use of specific 
oligonucleotide sequences which encode the DNA sequence of the desired mutation, as 
well as a sufficient number of adjacent nucleotides, to provide a primer sequence of 
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sufficient size and sequence complexity to form a stable duplex on both sides of the 
deletion junction being traversed. Mutations may be employed in a selected 
polynucleotide sequence to improve, alter, decrease, modify, or otherwise change the 
properties of the polynucleotide itself, and/or alter the properties, activity, composition, 
stability, or primary sequence of the encoded polypeptide. 

In certain embodiments of the present invention, the inventors 
contemplate the mutagenesis of the disclosed polynucleotide sequences to alter one or 
more properties of the encoded polypeptide, such as the antigenicity of a polypeptide 
vaccine. The techniques of site-specific mutagenesis are well-known in the art, and are 
widely used to create variants of both polypeptides and polynucleotides. For example, 
site-specific mutagenesis is often used to alter a specific portion of a DNA molecule. In 
such embodiments, a primer comprising typically about 14 to about 25 nucleotides or so 
in length is employed, with about 5 to about 10 residues on both sides of the junction of 
the sequence being altered. 

As will be appreciated by those of skill in the art, site-specific 
mutagenesis techniques have often employed a phage vector that exists in both a single 
stranded and double stranded form. Typical vectors useful in site-directed mutagenesis 
include vectors such as the Ml 3 phage. These phage are readily commercially-available 
and their use is generally well-known to those skilled in the art. Double-stranded 
plasmids are also routinely employed in site directed mutagenesis that eliminates the 
step of transferring the gene of interest from a plasmid to a phage. 

In general, site-directed mutagenesis in accordance herewith is 
performed by first obtaining a single-stranded vector or melting apart of two strands of a 
double-stranded vector that includes within its sequence a DNA sequence that encodes 
the desired peptide. An oligonucleotide primer bearing the desired mutated sequence is 
prepared, generally synthetically. This primer is then annealed with the single-stranded 
vector, and subjected to DNA polymerizing enzymes such as E. coli polymerase I 
Klenow fragment, in order to complete the synthesis of the mutation-bearing strand. 
Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated 
sequence and the second strand bears the desired mutation. This heteroduplex vector is 
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then used to transform appropriate cells, such as E. coli cells, and clones are selected 
which include recombinant vectors bearing the mutated sequence arrangement. 

The preparation of sequence variants of the selected peptide-encoding 
DNA segments using site-directed mutagenesis provides a means of producing 
potentially useful species and is not meant to be limiting as there are other ways in 
which sequence variants of peptides and the DNA sequences encoding them may be 
obtained. For example, recombinant vectors encoding the desired peptide sequence 
may be treated with mutagenic agents, such as hydroxylamine, to obtain sequence 
variants. Specific details regarding these methods and protocols are found in the 
teachings of Maloy et al, 1994; Segal, 1976; Prokop and Bajpai, 1991; Kuby, 1994; and 
Maniatis et aL, 1982, each incorporated herein by reference, for that purpose. 

As used herein, the term "oligonucleotide directed mutagenesis 
procedure" refers to template-dependent processes and vector-mediated propagation 
which result in an increase in the concentration of a specific nucleic acid molecule 
relative to its initial concentration, or in an increase in the concentration of a detectable 
signal, such as amplification. As used herein, the term "oligonucleotide directed 
mutagenesis procedure" is intended to refer to a process that involves the 
template-dependent extension of a primer molecule. The term template dependent 
process refers to nucleic acid synthesis of an RNA or a DNA molecule wherein the 
sequence of the newly synthesized strand of nucleic acid is dictated by the well-known 
rules of complementary base pairing (see, for example, Watson, 1987). Typically, 
vector mediated methodologies involve the introduction of the nucleic acid fragment 
into a DNA or RNA vector, the clonal amplification of the vector, and the recovery of 
the amplified nucleic acid fragment. Examples of such methodologies are provided by 
U. S. Patent No. 4,237,224, specifically incorporated herein by reference in its entirety. 

Polynucleotide Amplification Techniques 

A number of template dependent processes are available to amplify the 
target sequences of interest present in a sample. One of the best known amplification 
methods is the polymerase chain reaction (PCR™) which is described in detail in U.S. 
Patent Nos. 4,683,195, 4,683,202 and 4,800,159, each of which is incorporated herein 
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by reference in its entirety. Briefly, in PCR™, two primer sequences are prepared 
which are complementary to regions on opposite complementary strands of the target 
sequence. An excess of deoxynucleoside triphosphates is added to a reaction mixture 
along with a DNA polymerase (e.g., Tag polymerase). If the target sequence is present 
in a sample, the primers will bind to the target and the polymerase will cause the 
primers to be extended along the target sequence by adding on nucleotides. By raising 
and lowering the temperature of the reaction mixture, the extended primers will 
dissociate from the target to form reaction products, excess primers will bind to the 
target and to the reaction product and the process is repeated. Preferably reverse 
transcription and PCR™ amplification procedure may be performed in order to quantify 
the amount of mRNA amplified. Polymerase chain reaction methodologies are well 
known in the art. 

Another method for amplification is the ligase chain reaction (referred to 
as LCR), disclosed in Eur. Pat. Appl. Publ. No. 320,308 (specifically incorporated 
herein by reference in its entirety). In LCR, two complementary probe pairs are 
prepared, and in the presence of the target sequence, each pair will bind to opposite 
complementary strands of the target such that they abut. In the presence of a ligase, the 
two probe pairs will link to form a single unit. By temperature cycling, as in PCR™, 
bound ligated units dissociate from the target and then serve as "target sequences" for 
ligation of excess probe pairs. U.S. Patent No. 4,883,750, incorporated herein by 
reference in its entirety, describes an alternative method of amplification similar to LCR 
for binding probe pairs to a target sequence. 

Qbeta Replicase, described in PCT Intl. Pat. Appl. Publ. No. 
PCT/US87/00880, incorporated herein by reference in its entirety, may also be used as 
still another amplification method in the present invention. In this method, a replicative 
sequence of RNA that has a region complementary to that of a target is added to a 
sample in the presence of an RNA polymerase. The polymerase will copy the 
replicative sequence that can then be detected. 

An isothermal amplification method, in which restriction endonucleases 
and ligases are used to achieve the amplification of target molecules that contain 
nucleotide 5'-[a-thio]triphosphates in one strand of a restriction site (Walker et al. 9 
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1992, incorporated herein by reference in its entirety), may also be useful in the 
amplification of nucleic acids in the present invention. 

Strand Displacement Amplification (SDA) is another method of carrying 
out isothermal amplification of nucleic acids which involves multiple rounds of strand 
displacement and synthesis, i.e. nick translation. A similar method, called Repair Chain 
Reaction (RCR) is another method of amplification which may be useful in the present 
invention and is involves annealing several probes throughout a region targeted for 
amplification, followed by a repair reaction in which only two of the four bases are 
present. The other two bases can be added as biotinylated derivatives for easy 
detection. A similar approach is used in SDA. 

Sequences can also be detected using a cyclic probe reaction (CPR). In 
CPR, a probe having a 3' and 5' sequences of non-target DNA and an internal or 
"middle" sequence of the target protein specific RNA is hybridized to DNA which is 
present in a sample. Upon hybridization, the reaction is treated with RNaseH, and the 
products of the probe are identified as distinctive products by generating a signal that is 
released after digestion. The original template is annealed to another cycling probe and 
the reaction is repeated. Thus, CPR involves amplifying a signal generated by 
hybridization of a probe to a target gene specific expressed nucleic acid. 

Still other amplification methods described in Great Britain Pat. Appl. 
No. 2 202 328, and in PCT Intl. Pat. Appl. Publ. No. PCT/US89/01025, each of which 
is incorporated herein by reference in its entirety, may be used in accordance with the 
present invention. In the former application, "modified" primers are used in a PCR-like, 
template and enzyme dependent synthesis. The primers may be modified by labeling 
with a capture moiety (e.g., biotin) and/or a detector moiety (e.g., enzyme). In the latter 
application, an excess of labeled probes is added to a sample. In the presence of the 
target sequence, the probe binds and is cleaved catalytically. After cleavage, the target 
sequence is released intact to be bound by excess probe. Cleavage of the labeled probe 
signals the presence of the target sequence. 

Other nucleic acid amplification procedures include transcription-based 
amplification systems (TAS) (Kwoh et al, 1989; PCT Intl. Pat. Appl. Publ. No. WO 
88/10315, incorporated herein by reference in its entirety), including nucleic acid 
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sequence based amplification (NASBA) and 3SR. In NASBA, the nucleic acids can be 
prepared for amplification by standard phenol/chloroform extraction, heat denaturation 
of a sample, treatment with lysis buffer and minispin columns for isolation of DNA and 
RNA or guanidinium chloride extraction of RNA. These amplification techniques 
involve annealing a primer that has sequences specific to the target sequence. 
Following polymerization, DNA/RNA hybrids are digested with RNase H while double 
stranded DNA molecules are heat-denatured again. In either case the single stranded 
DNA is made fully double stranded by addition of second target-specific primer, 
followed by polymerization. The double stranded DNA molecules are then multiply 
transcribed by a polymerase such as T7 or SP6. In an isothermal cyclic reaction, the 
RNAs are reverse transcribed into DNA, and transcribed once again with a polymerase 
such as T7 or SP6. The resulting products, whether truncated or complete, indicate 
target-specific sequences. 

Eur. Pat. Appl. Publ. No. 329,822, incorporated herein by reference in its 
entirety, disclose a nucleic acid amplification process involving cyclically synthesizing 
single-stranded RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA), which 
may be used in accordance with the present invention. The ssRNA is a first template 
for a first primer oligonucleotide, which is elongated by reverse transcriptase 
(RNA-dependent DNA polymerase). The RNA is then removed from resulting 
DNA:RNA duplex by the action of ribonuclease H (RNase H, an RNase specific for 
RNA in a duplex with either DNA or RNA). The resultant ssDNA is a second template 
for a second primer, which also includes the sequences of an RNA polymerase promoter 
(exemplified by T7 RNA polymerase) 5' to its homology to its template. This primer is 
then extended by DNA polymerase (exemplified by the large "Klenow" fragment of E 
coli DNA polymerase I), resulting as a double-stranded DNA ("dsDNA") molecule, 
having a sequence identical to that of the original RNA between the primers and having 
additionally, at one end, a promoter sequence. This promoter sequence can be used by 
the appropriate RNA polymerase to make many RNA copies of the DNA. These copies 
can then re-enter the cycle leading to very swift amplification. With proper choice of 
enzymes, this amplification can be done isothermally without addition of enzymes at 
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each cycle. Because of the cyclical nature of this process, the starting sequence can be 
chosen to be in the form of either DNA or RNA. 

PCT IntL Pat. Appl. Publ. No. WO 89/06700, incorporated herein by 
reference in its entirety, disclose a nucleic acid sequence amplification scheme based on 
the hybridization of a promoter/primer sequence to a target single-stranded DNA 
("ssDNA") followed by transcription of many RNA copies of the sequence. This 
scheme is not cyclic; i.e. new templates are not produced from the resultant RNA 
transcripts. Other amplification methods include "RACE" (Frohman, 1990), and 
"one-sided PCR" (Ohara, 1989) which are well-known to those of skill in the art. 

Methods based on ligation of two (or more) oligonucleotides in the 
presence of nucleic acid having the sequence of the resulting "di-oligonucleotide", 
thereby amplifying the di-oligonucleotide (Wu and Dean, 1996, incorporated herein by 
reference in its entirety), may also be used in the amplification of DNA sequences of the 
present invention. 

Biological Functional Equivalents 

Modification and changes may be made in the structure of the 
polynucleotides and polypeptides of the present invention and still obtain a functional 
molecule that encodes a polypeptide with desirable characteristics. As mentioned 
above, it is often desirable to introduce one or more mutations into a specific 
polynucleotide sequence. In certain circumstances, the resulting encoded polypeptide 
sequence is altered by this mutation, or in other cases, the sequence of the polypeptide is 
unchanged by one or more mutations in the encoding polynucleotide. 

When it is desirable to alter the amino acid sequence of a polypeptide to 
create an equivalent, or even an improved, second-generation molecule, the amino acid 
changes may be achieved by changing one or more of the codons of the encoding DNA 
sequence, according to Table 1. 

For example, certain amino acids may be substituted for other amino 
acids in a protein structure without appreciable loss of interactive binding capacity with 
structures such as, for example, antigen-binding regions of antibodies or binding sites 
on substrate molecules. Since it is the interactive capacity and nature of a protein that 
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defines that protein's biological functional activity, certain amino acid sequence 
substitutions can be made in a protein sequence, and, of course, its underlying DNA 
coding sequence, and nevertheless obtain a protein with like properties. It is thus 
contemplated by the inventors that various changes may be made in the peptide 
sequences of the disclosed compositions, or corresponding DNA sequences which 
encode said peptides without appreciable loss of their biological utility or activity. 

Table 1 



Amino Acids Codons 



Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


c 


UGC 


UGU 










Aspartic acid 


Asd 

r 


D 


GAC 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


. UUC 


UUU 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 






Histidine 


His 


H 


CAC 


CAU 










Isoleucine 


He 


I 


AUA 


AUC 


AUU 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


CUU 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


ucc 


UCG 


UCU 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


w 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











In making such changes, the hydropathic index of amino acids may be 
considered. The importance of the hydropathic amino acid index in conferring 
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interactive biologic function on a protein is generally understood in the art (Kyte and 
Doolittle, 1982, incorporated herein by reference). It is accepted that the relative 
hydropathic character of the amino acid contributes to the secondary structure of the 
resultant protein, which in turn defines the interaction of the protein with other 
molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and 
the like. Each amino acid has been assigned a hydropathic index on the basis of its 
hydrophobicity and charge characteristics (Kyte and Doolittle, 1982). These values are: 
isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine 
(+2.5); methionine (+1.9); alanine (+1.8); glycine (-0.4); threonine (-0.7); serine (-0.8); 
tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); 
glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

It is known in the art that certain amino acids may be substituted by other 
amino acids having a similar hydropathic index or score and still result in a protein with 
similar biological activity, i.e. still obtain a biological functionally equivalent protein. 
In making such changes, the substitution of amino acids whose hydropathic indices are 
within +2 is preferred, those within ±1 are particularly preferred, and those within ±0.5 
are even more particularly preferred. It is also understood in the art that the substitution 
of like amino acids can be made effectively on the basis of hydrophilicity. U. S. Patent 
4,554,101 (specifically incorporated herein by reference in its entirety), states that the 
greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of 
its adjacent amino acids, correlates with a biological property of the protein. 

As detailed in U. S. Patent 4,554,101 , the following hydrophilicity values 
have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate 
(+3.0 ± 1); glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); 
glycine (0); threonine (-0.4); proline (-0.5 + 1); alanine (-0.5); histidine (-0.5); cysteine 
(-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (- 
2.3); phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be 
substituted for another having a similar hydrophilicity value and still obtain a 
biologically equivalent, and in particular, an immunologically equivalent protein. In 
such changes, the substitution of amino acids whose hydrophilicity values are within +2 
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is preferred, those within ±1 are particularly preferred, and those within ±0.5 are even 
more particularly preferred. 

As outlined above, amino acid substitutions are generally therefore based 
on the relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions that 
take various of the foregoing characteristics into consideration are well known to those 
of skill in the art and include: arginine and lysine; glutamate and aspartate; serine and 
threonine; glutamine and asparagine; and valine, leucine and isoleucine. 

In addition, any polynucleotide may be further modified to increase 
stability in vivo. Possible modifications include, but are not limited to, the addition of 
flanking sequences at the 5* and/or 3' ends; the use of phosphorothioate or 2 ! O-methyl 
rather than phosphodiesterase linkages in the backbone; and/or the inclusion of 
nontraditional bases such as inosine, queosine and wybutosine, as well as acetyl- 
methyl-, thio- and other modified forms of adenine, cytidine, guanine, thymine and 
uridine. 

In vrvo Polynucleotide Delivery Techniques 

In additional embodiments, genetic constructs comprising one or more of 
the polynucleotides of the invention are introduced into cells in vivo. This may be 
achieved using any of a variety or well known approaches, several of which are outlined 
below for the purpose of illustration. 
1. Adenovirus 

One of the preferred methods for in vivo delivery of one or more nucleic 
acid sequences involves the use of an adenovirus expression vector. "Adenovirus 
expression vector 1 ' is meant to include those constructs containing adenovirus sequences 
sufficient to (a) support packaging of the construct and (b) to express a polynucleotide 
that has been cloned therein in a sense or antisense orientation. Of course, in the 
context of an antisense construct, expression does not require that the gene product be 
synthesized. 

The expression vector comprises a genetically engineered form of an 
adenovirus. Knowledge of the genetic organization of adenovirus, a 36 kb, linear, 
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double-stranded DNA virus, allows substitution of large pieces of adenoviral DNA with 
foreign sequences up to 7 kb (Grunhaus and Horwitz, 1992). In contrast to retrovirus, 
the adenoviral infection of host cells does not result in chromosomal integration because 
adenoviral DNA can replicate in an episomal manner without potential genotoxicity. 
Also, adenoviruses are structurally stable, and no genome rearrangement has been 
detected after extensive amplification. Adenovirus can infect virtually all epithelial 
cells regardless of their cell cycle stage. So far, adenoviral infection appears to be 
linked only to mild disease such as acute respiratory disease in humans. 

Adenovirus is particularly suitable for use as a gene transfer vector 
because of its mid-sized genome, ease of manipulation, high titer, wide target-cell range 
and high infectivity. Both ends of the viral genome contain 100-200 base pair inverted 
repeats (ITRs), which are cis elements necessary for viral DNA replication and 
packaging. The early (E) and late (L) regions of the genome contain different 
transcription units that are divided by the onset of viral DNA replication. The El region 
(E1A and E1B) encodes proteins responsible for the regulation of transcription of the 
viral genome and a few cellular genes. The expression of the E2 region (E2A and E2B) 
results in the synthesis of the proteins for viral DNA replication. These proteins are 
involved in DNA replication, late gene expression and host cell shut-off (Renan, 1990). 
The products of the late genes, including the majority of the viral capsid proteins, are 
expressed only after significant processing of a single primary transcript issued by the 
major late promoter (MLP). The MLP, (located at 16.8 m.u.) is particularly efficient 
during the late phase of infection, and all the mRNA's issued from this promoter possess 
a 5'-tripartite leader (TPL) sequence which makes them preferred mRNA's for 
translation. 

In a current system, recombinant adenovirus is generated from 
homologous recombination between shuttle vector and provirus vector. Due to the 
possible recombination between two proviral vectors, wild-type adenovirus may be 
generated from this process. Therefore, it is critical to isolate a single clone of virus 
from an individual plaque and examine its genomic structure. 

Generation and propagation of the current adenovirus vectors, which are 
replication deficient, depend on a unique helper cell line, designated 293, which was 
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transformed from human embryonic kidney cells by Ad5 DNA fragments and 
constitutively expresses El proteins (Graham et aL, 1977). Since the E3 region is 
dispensable from the adenovirus genome (Jones and Shenk, 1978), the current 
adenovirus vectors, with the help of 293 cells, carry foreign DNA in either the El, the 
D3 or both regions (Graham and Prevec, 1991). In nature, adenovirus can package 
approximately 105% of the wild-type genome (Ghosh-Choudhury et al 9 1987), 
providing capacity for about 2 extra kB of DNA. Combined with the approximately 5.5 
kB of DNA that is replaceable in the El and E3 regions, the maximum capacity of the 
current adenovirus vector is under 7.5 kB, or about 15% of the total length of the vector. 
More than 80% of the adenovirus viral genome remains in the vector backbone and is 
the source of vector-borne cytotoxicity. Also, the replication deficiency of the El- 
deleted virus is incomplete. For example, leakage of viral gene expression has been 
observed with the currently available vectors at high multiplicities of infection (MOI) 
(Mulligan, 1993). 

Helper cell lines may be derived from human cells such as human 
embryonic kidney cells, muscle cells, hematopoietic cells or other human embryonic 
mesenchymal or epithelial cells. Alternatively, the helper cells may be derived from the 
cells of other mammalian species that are permissive for human adenovirus. Such cells 
include, e.g., Vero cells or other monkey embryonic mesenchymal or epithelial cells. 
As stated above, the currently preferred helper cell line is 293. 

Recently, Racher et al (1995) disclosed improved methods for culturing 
293 cells and propagating adenovirus. In one format, natural cell aggregates are grown 
by inoculating individual cells into 1 liter siliconized spinner flasks (Techne, 
Cambridge, UK) containing 100-200 ml of medium. Following stirring at 40 rpm, the 
cell viability is estimated with trypan blue. In another format, Fibra-Cel microcarriers 
(Bibby Sterlin, Stone, UK) (5 g/1) is employed as follows. A cell inoculum, 
resuspended in 5 ml of medium, is added to the carrier (50 ml) in a 250 ml Erlenmeyer 
flask and left stationary, with occasional agitation, for 1 to 4 h. The medium is then 
replaced with 50 ml of fresh medium and shaking initiated. For virus production, cells 
are allowed to grow to about 80% confluence, after which time the medium is replaced 
(to 25% of the final volume) and adenovirus added at an MOI of 0.05. Cultures are left 
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stationary overnight, following which the volume is increased to 100% and shaking 
commenced for another 72 h. 

Other than the requirement that the adenovirus vector be replication 
defective, or at least conditionally defective, the nature of the adenovirus vector is not 
believed to be crucial to the successful practice of the invention. The adenovirus may 
be of any of the 42 different known serotypes or subgroups A-F. Adenovirus type 5 of 
subgroup C is the preferred starting material in order to obtain a conditional replication- 
defective adenovirus vector for use in the present invention, since Adenovirus type 5 is 
a human adenovirus about which a great deal of biochemical and genetic information is 
known, and it has historically been used for most constructions employing adenovirus 
as a vector. 

As stated above, the typical vector according to the present invention is 
replication defective and will not have an adenovirus El region. Thus, it will be most 
convenient to introduce the polynucleotide encoding the gene of interest at the position 
from which the El -coding sequences have been removed. However, the position of 
insertion of the construct within the adenovirus sequences is not critical to the 
invention. The polynucleotide encoding the gene of interest may also be inserted in lieu 
of the deleted E3 region in E3 replacement vectors as described by Karlsson et al 
(1986) or in the E4 region where a helper cell line or helper virus complements the E4 
defect. 

Adenovirus is easy to grow and manipulate and exhibits broad host range 
in vitro and in vivo. This group of viruses can be obtained in high titers, e.g., 10 9 -10 n 
plaque-forming units per ml, and they are highly infective. The life cycle of adenovirus 
does not require integration into the host cell genome. The foreign genes delivered by 
adenovirus vectors are episomal and, therefore, have low genotoxicity to host cells. No 
side effects have been reported in studies of vaccination with wild-type adenovirus 
(Couch et ai, 1963; Top et al, 1971), demonstrating their safety and therapeutic 
potential as in vivo gene transfer vectors. 

Adenovirus vectors have been used in eukaryotic gene expression 
(Levrero et aL, 1991; Gomez-Foix et al, 1992) and vaccine development (Grunhaus 
and Horwitz, 1992; Graham and Prevec, 1992). Recently, animal studies suggested that 
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recombinant adenovirus could be used for gene therapy (Stratford-Perricaudet and 
Perricaudet, 1991; Stratford-Perricaudet et al, 1990; Rich et al, 1993). Studies in 
administering recombinant adenovirus to different tissues include trachea instillation 
(Rosenfeld et al, 1991; Rosenfeld et al, 1992), muscle injection (Ragot et al, 1993), 
peripheral intravenous injections (Herz and Gerard, 1993) and stereotactic inoculation 
into the brain (Le Gal La Salle et al , 1 993). 
2. Retroviruses 

The retroviruses are a group of single-stranded RNA viruses 
characterized by an ability to convert their RNA to double-stranded DNA in infected 
cells by a process of reverse-transcription (Coffin, 1990). The resulting DNA then 
stably integrates into cellular chromosomes as a provirus and directs synthesis of viral 
proteins. The integration results in the retention of the viral gene sequences in the 
recipient cell and its descendants. The retroviral genome contains three genes, gag, pol, 
and env that code for capsid proteins, polymerase enzyme, and envelope components, 
respectively. A sequence found upstream from the gag gene contains a signal for 
packaging of the genome into virions. Two long terminal repeat (LTR) sequences are 
present at the 5' and 3' ends of the viral genome. These contain strong promoter and 
enhancer sequences and are also required for integration in the host cell genome 
(Coffin, 1990). 

In order to construct a retroviral vector, a nucleic acid encoding one or 
more oligonucleotide or polynucleotide sequences of interest is inserted into the viral 
genome in the place of certain viral sequences to produce a virus that is replication- 
defective. In order to produce virions, a packaging cell line containing the gag, pol, and 
env genes but without the LTR and packaging components is constructed (Mann et al, 
1983). When a recombinant plasmid containing a cDNA, together with the retroviral 
LTR and packaging sequences is introduced into this cell line (by calcium phosphate 
precipitation for example), the packaging sequence allows the RNA transcript of the 
recombinant plasmid to be packaged into viral particles, which are then secreted into the 
culture media (Nicolas and Rubenstein, 1988; Temin, 1986; Mann et al, 1983). The 
media containing the recombinant retroviruses is then collected, optionally 
concentrated, and used for gene transfer. Retroviral vectors are able to infect a broad 
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variety of cell types. However, integration and stable expression require the division of 
host cells (Paskind et al, 1975). 

A novel approach designed to allow specific targeting of retrovirus 
vectors was recently developed based on the chemical modification of a retrovirus by 
the chemical addition of lactose residues to the viral envelope. This modification could 
permit the specific infection of hepatocytes via sialoglycoprotein receptors. 

A different approach to targeting of recombinant retroviruses was 
designed in which biotinylated antibodies against a retroviral envelope protein and 
against a specific cell receptor were used. The antibodies were coupled via the biotin 
components by using streptavidin (Roux et al, 1989). Using antibodies against major 
histocompatibility complex class I and class II antigens, they demonstrated the infection 
of a variety of human cells that bore those surface antigens with an ecotropic virus in 
vitro (Roux et al, 1989). 

3 . Adeno-Associated Viruses 

AAV (Ridgeway, 1988; Hermonat and Muzycska, 1984) is a parovirus, 
discovered as a contamination of adenoviral stocks. It is a ubiquitous virus (antibodies 
are present in 85% of the US human population) that has not been linked to any disease. 
It is also classified as a dependovirus, because its replications is dependent on the 
presence of a helper virus, such as adenovirus. Five serotypes have been isolated, of 
which AAV-2 is the best characterized. AAV has a single-stranded linear DNA that is 
encapsidated into capsid proteins VP1, VP2 and VP3 to form an icosahedral virion of 
20 to 24 nm in diameter (Muzyczka and McLaughlin, 1988). 

The AAV DNA is approximately 4.7 kilobases long. It contains two 
open reading frames and is flanked by two ITRs (FIG. 2). There are two major genes in 
the AAV genome: rep and cap. The rep gene codes for proteins responsible for viral 
replications, whereas cap codes for capsid protein VP1-3. Each ITR forms a T-shaped 
hairpin structure. These terminal repeats are the only essential cis components of the 
AAV for chromosomal integration. Therefore, the AAV can be used as a vector with all 
viral coding sequences removed and replaced by the cassette of genes for delivery. 
Three viral promoters have been identified and named p5, pi 9, and p40, according to 
their map position. Transcription from p5 and pi 9 results in production of rep proteins, 
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and transcription from p40 produces the capsid proteins (Hermonat and Muzyczka, 
1984). 

There are several factors that prompted researchers to study the 
possibility of using rAAV as an expression vector One is that the requirements for 
delivering a gene to integrate into the host chromosome are surprisingly few. It is 
necessary to have the 145-bp ITRs, which are only 6% of the AAV genome. This 
leaves room in the vector to assemble a 4.5-kb DNA insertion. While this carrying 
capacity may prevent the AAV from delivering large genes, it is amply suited for 
delivering the antisense constructs of the present invention. 

AAV is also a good choice of delivery vehicles due to its safety. There is 
a relatively complicated rescue mechanism: not only wild type adenovirus but also 
AAV genes are required to mobilize rAAV. Likewise, AAV is not pathogenic and not 
associated with any disease. The removal of viral coding sequences minimizes immune 
reactions to viral gene expression, and therefore, rAAV does not evoke an inflammatory 
response. 

4. Other Viral Vectors as Expression Constructs 

Other viral vectors may be employed as expression constructs in the 
present invention for the delivery of oligonucleotide or polynucleotide sequences to a 
host cell. Vectors derived from viruses such as vaccinia virus (Ridgeway, 1988; Coupar 
et al> 1988), lentiviruses, polio viruses and herpes viruses may be employed. They 
offer several attractive features for various mammalian cells (Friedmann, 1989; 
Ridgeway, 1988; Coupar et ai 9 1988; Horwich et al 9 1990). 

With the recent recognition of defective hepatitis B viruses, new insight 
was gained into the structure-function relationship of different viral sequences. In vitro 
studies showed that the virus could retain the ability for helper-dependent packaging 
and reverse transcription despite the deletion of up to 80% of its genome (Horwich et 
al 9 1990). This suggested that large portions of the genome could be replaced with 
foreign genetic material. The hepatotropism and persistence (integration) were 
particularly attractive properties for liver-directed gene transfer. Chang et al (1991) 
introduced the chloramphenicol acetyltransferase (CAT) gene into duck hepatitis B 
virus genome in the place of the polymerase, surface, and pre-surface coding sequences. 
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It was cotransfected with wild-type virus into an avian hepatoma cell line. Culture 
media containing high titers of the recombinant virus were used to infect primary 
duckling hepatocytes. Stable CAT gene expression was detected for at least 24 days 
after transfection (Chang etal, 1991). 
5. NON-VIRAL VECTORS 

In order to effect expression of the oligonucleotide or polynucleotide 
sequences of the present invention, the expression construct must be delivered into a 
cell. This delivery may be accomplished in vitro, as in laboratory procedures for 
transforming cells lines, or in vivo or ex vivo, as in the treatment of certain disease 
states. As described above, one preferred mechanism for delivery is via viral infection 
where the expression construct is encapsulated in an infectious viral particle. 

Once the expression construct has been delivered into the cell the nucleic 
acid encoding the desired oligonucleotide or polynucleotide sequences may be 
positioned and expressed at different sites. In certain embodiments, the nucleic acid 
encoding the construct may be stably integrated into the genome of the cell. This 
integration may be in the specific location arid orientation via homologous 
recombination (gene replacement) or it may be integrated in a random, non-specific 
location (gene augmentation). In yet further embodiments, the nucleic acid may be 
stably maintained in the cell as a separate, episomal segment of DNA. Such nucleic 
acid segments or "episomes" encode sequences sufficient to permit maintenance and 
replication independent of or in synchronization with the host cell cycle. How the 
expression construct is delivered to a cell and where in the cell the nucleic acid remains 
is dependent on the type of expression construct employed. 

In certain embodiments of the invention, the expression construct 
comprising one or more oligonucleotide or polynucleotide sequences may simply 
consist of naked recombinant DNA or plasmids. Transfer of the construct may be 
performed by any of the methods mentioned above which physically or chemically 
permeabilize the cell membrane. This is particularly applicable for transfer in vitro but 
it may be applied to in vivo use as well. Dubensky et al (1984) successfully injected 
polyomavirus DNA in the form of calcium phosphate precipitates into liver and spleen 
of adult and newborn mice demonstrating active viral replication and acute infection. 



WO 02/02131 



45 



PCT/US01/20981 



Benvenisty and Reshef (1986) also demonstrated that direct intraperitoneal injection of 
calcium phosphate-precipitated plasmids results in expression of the transfected genes. 
It is envisioned that DNA encoding a gene of interest may also be transferred in a 
similar manner in vivo and express the gene product. 

Another embodiment of the invention for transferring a naked DNA 
expression construct into cells may involve particle bombardment. This method 
depends on the ability to accelerate DNA-coated microprojectiles to a high velocity 
allowing them to pierce cell membranes and enter cells without killing them (Klein et 
al, 1987). Several devices for accelerating small particles have been developed. One 
such device relies on a high voltage discharge to generate an electrical current, which in 
turn provides the motive force (Yang et aL, 1990). The microprojectiles used have 
consisted of biologically inert substances such as tungsten or gold beads. 

Selected organs including the liver, skin, and muscle tissue of rats and 
mice have been bombarded in vivo (Yang et al y 1990; Zelenin et al 9 1991). This may 
require surgical exposure of the tissue or cells, to eliminate any intervening tissue 
between the gun and the target organ, i.e. ex vivo treatment. Again, DNA encoding a 
particular gene may be delivered via this method and still be incorporated by the present 
invention. 

Antisense Oligonucleotides 

The end result of the flow of genetic information is the synthesis of 
protein. DNA is transcribed by polymerases into messenger RNA and translated on the 
ribosome to yield a folded, functional protein. Thus there are several steps along the 
route where protein synthesis can be inhibited. The native DNA segment coding for a 
polypeptide described herein, as all such mammalian DNA strands, has two strands: a 
sense strand and an antisense strand held together by hydrogen bonding. The messenger 
RNA coding for polypeptide has the same nucleotide sequence as the sense DNA strand 
except that the DNA thymidine is replaced by uridine. Thus, synthetic antisense 
nucleotide sequences will bind to a mRNA and inhibit expression of the protein 
encoded by that mRNA. 
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The targeting of antisense oligonucleotides to mRNA is thus one 
mechanism to shut down protein synthesis, and, consequently, represents a powerful 
and targeted therapeutic approach. For example, the synthesis of polygalactauronase 
and the muscarine type 2 acetylcholine receptor are inhibited by antisense 
oligonucleotides directed to their respective mRNA sequences (U. S. Patent 5,739,119 
and U. S. Patent 5,759,829, each specifically incorporated herein by reference in its 
entirety). Further, examples of antisense inhibition have been demonstrated with the 
nuclear protein cyclin, the multiple drug resistance gene (MDG1), ICAM-1, E-selectin, 
STK-1, striatal GABA A receptor and human EGF (Jaskulski et al, 1988; 
Vasanthakumar and Ahmed, 1989; Peris et al 9 1998; U. S. Patent 5,801,154; U. S. 
Patent 5,789,573; U. S. Patent 5,718,709 and U. S. Patent 5,610,288, each specifically 
incorporated herein by reference in its entirety). Antisense constructs have also been 
described that inhibit and can be used to treat a variety of abnormal cellular 
proliferations, e.g. cancer (U. S. Patent 5,747,470; U. S. Patent 5,591,317 and U. S. 
Patent 5,783,683, each specifically incorporated herein by reference in its entirety). 

Therefore, in exemplary embodiments, the invention provides 
oligonucleotide sequences that comprise all, or a portion of, any sequence that is 
capable of specifically binding to polynucleotide sequence described herein, or a 
complement thereof. In one embodiment, the antisense oligonucleotides comprise DNA 
or derivatives thereof. In another embodiment, the oligonucleotides comprise RNA or 
derivatives thereof. In a third embodiment, the oligonucleotides are modified DNAs 
comprising a phosphorothioated modified backbone. In a fourth embodiment, the 
oligonucleotide sequences comprise peptide nucleic acids or derivatives thereof. In 
each case, preferred compositions comprise a sequence region that is complementary, 
and more preferably substantially-complementary, and even more preferably, 
completely complementary to one or more portions of polynucleotides disclosed herein. 

Selection of antisense compositions specific for a given gene sequence is 
based upon analysis of the chosen target sequence (i.e. in these illustrative examples the 
rat and human sequences) and determination of secondary structure, T m , binding energy, 
relative stability, and antisense compositions were selected based upon their relative 
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inability to form dimers, hairpins, or other secondary structures that would reduce or 
prohibit specific binding to the target mRNA in a host cell. 

Highly preferred target regions of the mRNA, are those which are at or 
near the AUG translation initiation codon, and those sequences which were 
substantially complementary to 5' regions of the mRNA. These secondary structure 
analyses and target site selection considerations were performed using v.4 of the 
OLIGO primer analysis software (Rychlik, 1997) and the BLASTN 2.0.5 algorithm 
software (Altschul et al, 1997). 

The use of an antisense delivery method employing a short peptide 
vector, termed MPG (27 residues), is also contemplated. The MPG peptide contains a 
hydrophobic domain derived from the fusion sequence of HIV gp41 and a hydrophilic 
domain from the nuclear localization sequence of SV40 T-antigen (Morris et aL 9 1997). 
It has been demonstrated that several molecules of the MPG peptide coat the antisense 
oligonucleotides and can be delivered into cultured mammalian cells in less than 1 hour 
with relatively high efficiency (90%). Further, the interaction with MPG strongly 
increases both the stability of the oligonucleotide to nuclease and the ability to cross the 
plasma membrane (Morris et al. 9 1997). 

RIBOZYMES 

Although proteins traditionally have been used for catalysis of nucleic 
acids, another class of macromolecules has emerged as useful in this endeavor. 
Ribozymes are RNA-protein complexes that cleave nucleic acids in a site-specific 
fashion. Ribozymes have specific catalytic domains that possess endonuclease activity 
(Kim and Cech, 1987; Gerlach et al, 1987; Forster and Symons, 1987). For example, a 
large number of ribozymes accelerate phosphoester transfer reactions with a high degree 
of specificity, often cleaving only one of several phosphoesters in an oligonucleotide 
substrate (Cech et al, 1981; Michel and Westhof, 1990; Reinhold-Hurek and Shub, 
1992). This specificity has been attributed to the requirement that the substrate bind via 
specific base-pairing interactions to the internal guide sequence ("IGS") of the ribozyme 
prior to chemical reaction. 
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Ribozyme catalysis has primarily been observed as part of sequence- 
specific cleavage/ligation reactions involving nucleic acids (Joyce, 1989; Cech et al, 
1981). For example, U. S. Patent No. 5,354,855 (specifically incorporated herein by 
reference) reports that certain ribo2ymes can act as endonucleases with a sequence 
specificity greater than that of known ribonucleases and approaching that of the DNA 
restriction enzymes. Thus, sequence-specific ribozyme-mediated inhibition of gene 
expression may be particularly suited to therapeutic applications (Scanlon et al, 1991; 
Sarver et aL, 1990). Recently, it was reported that ribozymes elicited genetic changes in 
some cells lines to which they were applied; the altered genes included the oncogenes 
H-ray, c-fos and genes of HTV. Most of this work involved the modification of a target 
mRNA, based on a specific mutant codon that is cleaved by a specific ribozyme. 

Six basic varieties of naturally-occurring enzymatic RNAs are known 
presently. Each can catalyze the hydrolysis of RNA phosphodiester bonds in tram (and 
thus can cleave other RNA molecules) under physiological conditions. In general, 
enzymatic nucleic acids act by first binding to a target RNA. Such binding occurs 
through the target binding portion of a enzymatic nucleic acid which is held in close 
proximity to an enzymatic portion of the molecule that acts to cleave the target RNA. 
Thus, the enzymatic nucleic acid first recognizes and then binds a target RNA through 
complementary base-pairing, and once bound to the coirect site, acts enzymatically to 
cut the target RNA. Strategic cleavage of such a target RNA will destroy its ability to 
direct synthesis of an encoded protein. After an enzymatic nucleic acid has bound and 
cleaved its RNA target, it is released from that RNA to search for another target and can 
repeatedly bind and cleave new targets. 

The enzymatic nature of a ribozyme is advantageous over many 
technologies, such as antisense technology (where a nucleic acid molecule simply binds 
to a nucleic acid target to block its translation) since the concentration of ribozyme 
necessary to affect a therapeutic treatment is lower than that of an antisense 
oligonucleotide. This advantage reflects the ability of the ribozyme to act 
enzymatically. Thus, a single ribozyme molecule is able to cleave many molecules of 
target RNA. In addition, the ribozyme is a highly specific inhibitor, with the specificity 
of inhibition depending not only on the base pairing mechanism of binding to the target 
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RNA, but also on the mechanism of target RNA cleavage. Single mismatches, or base- 
substitutions, near the site of cleavage can completely eliminate catalytic activity of a 
ribozyme. Similar mismatches in antisense molecules do not prevent their action 
(Woolf et al , 1992). Thus, the specificity of action of a ribozyme is greater than that of 
an antisense oligonucleotide binding the same RNA site. 

The enzymatic nucleic acid molecule may be formed in a hammerhead, 
hairpin, a hepatitis 5 virus, group I intron or RNaseP RNA (in association with an RNA 
guide sequence) or Neurospora VS RNA motif. Examples of hammerhead motifs are 
described by Rossi et al (1992). Examples of hairpin motifs are described by Hampel 
etal (Eur. Pat. Appl. Publ. No. EP 0360257), Hampel and Tritz (1989), Hampel etal 
(1990) and U. S. Patent 5,631,359 (specifically incorporated herein by reference). An 
example of the hepatitis 5 virus motif is described by Perrotta and Been (1992); an 
example of the RNaseP motif is described by Guerrier-Takada et al (1983); Neurospora 
VS RNA ribozyme motif is described by Collins (Saville and Collins, 1990; Saville and 
Collins, 1991; Collins and Olive, 1993); and an example of the Group I intron is 
described in (U. S. Patent 4,987,071, specifically incorporated herein by reference). All 
that is important in an enzymatic nucleic acid molecule of this invention is that it has a 
specific substrate binding site which is complementary to one or more of the target gene 
RNA regions, and that it have nucleotide sequences within or surrounding that substrate 
binding site which impart an RNA cleaving activity to the molecule. Thus the ribozyme 
constructs need not be limited to specific motifs mentioned herein. 

In certain embodiments, it may be important to produce enzymatic 
cleaving agents which exhibit a high degree of specificity for the RNA of a desired 
target, such as one of the sequences disclosed herein. The enzymatic nucleic acid 
molecule is preferably targeted to a highly conserved sequence region of a target 
mRNA. Such enzyrfiatic nucleic acid molecules can be delivered exogenously to 
specific cells as required. Alternatively, the ribozymes can be expressed from DNA or 
RNA vectors that are delivered to specific cells. 

Small enzymatic nucleic acid motifs (e.g., of the hammerhead or the 
hairpin structure) may also be used for exogenous delivery. The simple structure of 
these molecules increases the ability of the enzymatic nucleic acid to invade targeted 
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regions of the mRNA structure. Alternatively, catalytic RNA molecules can be 
expressed within cells from eukaiyotic promoters (e.g., Scanlon etal, 1991; Kashani- 
Sabet etal, 1992; Dropulic etal, 1992; Weerasinghe etal., 1991; Ojwang etal, 1992; 
Chen et al , 1992; Sarver et al, 1990). Those skilled in the art realize that any ribozyme 
can be expressed in eukaryotic cells from the appropriate DNA vector. The activity of 
such ribozymes can be augmented by their release from the primary transcript by a 
second ribozyme (Int. Pat. Appl. Publ. No. WO 93/23569, and Int. Pat. Appl. Publ. No. 
WO 94/02595, both hereby incorporated by reference; Ohkawa et al, 1992; Taira et al, 
1991; and Venturas al, 1993). 

Ribozymes may be added directly, or can be complexed with cationic 
lipids, lipid complexes, packaged within liposomes, or otherwise delivered to target . 
cells. The RNA or RNA complexes can be locally administered to relevant tissues ex 
vivo, or in vivo through injection, aerosol inhalation, infusion pump or stent, with or 
without their incorporation in biopolymers. 

Ribozymes may be designed as described in Int. Pat. Appl. Publ. No. 
WO 93/23569 and Int. Pat. Appl. Publ. No. WO 94/02595, each specifically 
incorporated herein by reference) and synthesized to be tested in vitro and in vivo, as 
described. Such ribozymes can also be optimized for delivery. While specific 
examples are provided, those in the art will recognize that equivalent RNA targets in 
other species can be utilized when necessary. 

Hammerhead or hairpin ribozymes may be individually analyzed by 
computer folding (Jaeger etal, 1989) to assess whether the ribozyme sequences fold 
into the appropriate secondary structure. Those ribozymes with unfavorable 
intramolecular interactions between the binding arms and the catalytic core are 
eliminated from consideration. Varying binding arm lengths can be chosen to optimize 
activity. Generally, at least 5 or so bases on each arm are able to bind to, or otherwise 
interact with, the target RNA. 

Ribozymes of the hammerhead or hairpin motif may be designed to 
anneal to various sites in the mRNA message, and can be chemically synthesized. The 
method of synthesis used follows the procedure for normal RNA synthesis as described 
in Usman et al (1987) and in Scaringe et al (1990) and makes use of common nucleic 
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acid protecting and coupling groups, such as dimethoxytrityl at the 5'-end, and 
phosphoramidites at the 3'-end. Average stepwise coupling yields are typically >98%. 
Hairpin ribozymes may be synthesized in two parts and annealed to reconstruct an 
active ribozyme (Chowrira and Burke, 1992). Ribozymes may be modified extensively 
to enhance stability by modification with nuclease resistant groups, for example, 2'- 
amino, 2'-C-allyl, 2'-flouro, 2'-o-methyl, 2'-H (for a review see e.g., Usman and 
Cedergren, 1992). Ribozymes may be purified by gel electrophoresis using general 
methods or by high pressure liquid chromatography and resuspended in water. 

Ribozyme activity can be optimized by altering the length of the 
ribozyme binding, arms, or chemically synthesizing ribozymes with modifications that 
prevent their degradation by serum ribonucleases (see e.g., Int. Pat. Appl. PubL No. WO 
92/07065; Perrault'ef al, 1990; Pieken etal, 1991; Usman and Cedergren, 1992; Int. 
Pat. Appl. Publ. No. WO 93/15187; Int. Pat. Appl. Publ. No. WO 91/03162; Eur. Pat. 
Appl. Publ. No. 92110298.4; U. S. Patent 5,334,711; and Int. Pat. Appl. Publ. No. WO 
94/13688, which describe various chemical modifications that can be made to the sugar 
moieties of enzymatic RNA molecules), modifications which enhance their efficacy in 
cells, and removal of stem II bases to shorten RNA synthesis times and reduce chemical 
requirements. 

Sullivan et al (Int. Pat. Appl. Publ. No. WO 94/02595) describes the 
general methods for delivery of enzymatic RNA molecules. Ribozymes may be 
administered to cells by a variety of methods known to those familiar to the art, 
including, but not restricted to, encapsulation in liposomes, by iontophoresis, or by 
incorporation into other vehicles, such as hydrogels, cyclodextrins, biodegradable 
nanocapsules, and bioadhesive microspheres. For some indications, ribozymes may be 
directly delivered ex vivo to cells or tissues with or without the aforementioned vehicles. 
Alternatively, the RNA/vehicle combination may be locally delivered by direct 
inhalation, by direct injection or by use of a catheter, infusion pump or stent. Other 
routes of delivery include, but are not limited to, intravascular, intramuscular, 
subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill form), topical, 
systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed descriptions 
of ribozyme delivery and administration are provided in Int. Pat. Appl. Publ. No. WO 
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94/02595 and Int. Pat. Appl. Publ. No. WO 93/23569, each specifically incorporated 
herein by reference. 

Another means of accumulating high concentrations of a ribozyme(s) 
within cells is to incorporate the ribozyme-encoding sequences into a DNA expression 
vector. Transcription of the ribozyme sequences are driven from a promoter for 
eukaryotic RNA polymerase I (pol I), RNA polymerase II (pol II), or RNA polymerase 
III (pol HI). Transcripts from pol II or pol III promoters will be expressed at high levels 
in all cells; the levels of a given pol II promoter in a given cell type will depend on the 
nature of the gene regulatory sequences (enhancers, silencers, etc.) present nearby. 
Prokaryotic RNA polymerase promoters may also be used, providing that the 
prokaryotic RNA polymerase enzyme is expressed in the appropriate cells (Elroy-Stein 
and Moss, 1990; Gao and Huang, 1993; Lieber etal, 1993; Zhou etal, 1990). 
Ribozymes expressed from such promoters can function in mammalian cells (e.g. 
Kashani-Saber etal, 1992; Ojwang etal, 1992; Chen etal, 1992; Yu etal, 1993; 
L'Huillier etal, 1992; Lisziewicz etal, 1993). Such transcription units can be 
incorporated into a variety of vectors for introduction into mammalian cells, including 
but not restricted to, plasmid DNA vectors, viral DNA vectors (such as adenovirus or 
adeno-associated vectors), or viral RNA vectors (such as retroviral, semliki forest virus, 
sindbis virus vectors). 

Ribozymes may be used as diagnostic tools to examine genetic drift and 
mutations within diseased cells. They can also be used to assess levels of the target 
RNA molecule. The close relationship between ribozyme activity and the structure of 
the target RNA allows the detection of mutations in any region of the molecule which 
alters the base-pairing and three-dimensional structure of the target RNA. By using 
multiple ribozymes, one may map nucleotide changes which are important to RNA 
structure and function in vitro, as well as in cells and tissues. Cleavage of target RNAs 
with ribozymes may be used to inhibit gene expression and define the role (essentially) 
of specified gene products in the progression of disease. In this manner, other genetic 
targets may be defined as important mediators of the disease. These studies will lead to 
better treatment of the disease progression by affording the possibility of combinational 
therapies (e.g., multiple ribozymes targeted to different genes, ribozymes coupled with 
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known small molecule inhibitors, or intermittent treatment with combinations of 
ribozymes and/or other chemical or biological molecules). Other in vitro uses of 
ribozymes are well known in the art, and include detection of the presence of mRNA 
associated with an IL-5 related condition. Such RNA is detected by determining the 
presence of a cleavage product after treatment with a ribozyme using standard 
methodology. 

Peptide Nucleic Acids 

In certain embodiments, the inventors contemplate the use of peptide 
nucleic acids (PNAs) in the practice of the methods of the invention. PNA is a DNA 
mimic in which the nucleobases are attached to a pseudopeptide backbone (Good and 
Nielsen, 1997). PNA is able to be utilized in a number methods that traditionally have 
used RNA or DNA. Often PNA sequences perform better in techniques than the 
corresponding RNA or DNA sequences and have utilities that are not inherent to RNA 
or DNA. A review of PNA including methods of making, characteristics of, and 
methods of using, is provided by Corey (1997) and is incorporated herein by reference. 
As such, in certain embodiments, one may prepare PNA sequences that are 
complementary to one or more portions of the ACE mRNA sequence, and such PNA 
compositions may be used to regulate, alter, decrease, or reduce the translation of ACE- 
specific mRNA, and thereby alter the level of ACE activity in a host cell to which such 
PNA compositions have been administered. 

PNAs have 2-aminoethyl-glycine linkages replacing the normal 
phosphodiester backbone of DNA (Nielsen et al , 1 99 1 ; Hanvey et ah , 1 992; Hyrup and 
Nielsen, 1996; Neilsen, 1996). This chemistry has three important consequences: 
firstly, in contrast to DNA or phosphorothioate oligonucleotides, PNAs are neutral 
molecules; secondly, PNAs are achiral, which avoids the need to develop a 
stereoselective synthesis; and thirdly, PNA synthesis uses standard Boc (Dueholm et al, 
1994) or Fmoc (Thomson et al, 1995) protocols for solid-phase peptide synthesis, 
although other methods, including a modified Merrifield method, have been used 
(Christensen et al, 1995). 
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PNA monomers or ready-made oligomers are commercially available 
from PerSeptive Biosystems (Framingham, MA). PNA syntheses by either Boc or 
Fmoc protocols are straightforward using manual or automated protocols (Norton et al, 
1995). The manual protocol lends itself to the production of chemically modified PNAs 
or the simultaneous synthesis of families of closely related PNAs. 

As with peptide synthesis, the success of a particular PNA synthesis will 
depend on the properties of the chosen sequence. For example, while in theory PNAs 
can incorporate any combination of nucleotide bases, the presence of adjacent purines 
can lead to deletions of one or more residues in the product. In expectation of this 
difficulty, it is suggested that, in producing PNAs with adjacent purines, one should 
repeat the coupling of residues likely to be added inefficiently. This should be followed 
by the purification of PNAs by reverse-phase high-pressure liquid chromatography 
(Norton et al., 1995) providing yields and purity of product similar to those observed 
during the synthesis of peptides. 

Modifications of PNAs for a given application may be accomplished by 
coupling amino acids during solid-phase synthesis or by attaching compounds that 
contain a carboxylic acid group to the exposed N-terminal amine. Alternatively, PNAs 
can be modified after synthesis by coupling to an introduced lysine or cysteine. The 
ease with which PNAs can be modified facilitates optimization for better solubility or 
for specific functional requirements. Once synthesized, the identity of PNAs and their 
derivatives can be confirmed by mass spectrometry. Several studies have made and 
utilized modifications of PNAs (Norton et al, 1995; Haaima et al, 1996; Stetsenko et 
al, 1996; Petersen et al, 1995; Ulmann et al, 1996; Koch et al, 1995; Orum et al, 
1995; Footer et al, 1996; Griffith et al, 1995; Kremsky et al, 1996; Pardridge et al, 
1995; Boffa et al, 1995; Landsdorp et al, 1996; Gambacorti-Passerini et al, 1996; 
Armitage et al, 1997; Seeger et al, 1997; Ruskowski et al, 1997). U.S. Patent No. 
5,700,922 discusses PNA-DNA-PNA chimeric molecules and their uses in diagnostics, 
modulating protein in organisms, and treatment of conditions susceptible to 
therapeutics. 

In contrast to DNA and RNA, which contain negatively charged 
linkages, the PNA backbone is neutral. In spite of this dramatic alteration, PNAs 
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recognize complementary DNA and KNA by Watson-Crick pairing (Egholm et al, 
1993), validating the initial modeling by Nielsen et al (1991). PNAs lack 3' to 5' 
polarity and can bind in either parallel or antiparallel fashion, with the antiparallel mode 
being preferred (Egholm et al , 1 993). 

Hybridization of DNA oligonucleotides to DNA and RNA is destabilized 
by electrostatic repulsion between the negatively charged phosphate backbones of the 
complementary strands. By contrast, the absence of charge repulsion in PNA-DNA or 
PNA-RNA duplexes increases the melting temperature (T m ) and reduces the dependence 
of T m on the concentration of mono- or divalent cations (Nielsen et al, 1991). The 
enhanced rate and affinity of hybridization are significant because they are responsible 
for the surprising ability of PNAs to perform strand invasion of complementary 
sequences within relaxed double-stranded DNA. In addition, the efficient hybridization 
at inverted repeats suggests that PNAs can recognize secondary structure effectively 
within double-stranded DNA. Enhanced recognition also occurs with PNAs 
immobilized on surfaces, and Wang et al have shown that support-bound PNAs can be 
used to detect hybridization events (Wang et al, 1996). 

One might expect that tight binding of PNAs to complementary 
sequences would also increase binding to similar (but not identical) sequences, reducing 
the sequence specificity of PNA recognition. As with DNA hybridization, however, 
selective recognition can be achieved by balancing oligomer length and incubation 
temperature. Moreover, selective hybridization of PNAs is encouraged by PNA-DNA 
hybridization being less tolerant of base mismatches than DNA-DNA hybridization. 
For example, a single mismatch within a 16 bp PNA-DNA duplex can reduce the T m by 
up to 15°C (Egholm et al, 1993). This high level of discrimination has allowed the 
development of several PNA-based strategies for the analysis of point mutations (Wang 
et al, 1996; Carlsson et al, 1996; Thiede et al, 1996; Webb and Hurskainen, 1996; 
Perry-O'Keefe etal, 1996). 

High-affinity binding provides clear advantages for molecular 
recognition and the development of new applications for PNAs. For example, 11-13 
nucleotide PNAs inhibit the activity of telomerase, a ribonucleo-protein that extends 
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telomere ends using an essential RNA template, while the analogous DNA oligomers do 
not (Norton et al, 1996). 

Neutral PNAs are more hydrophobic than analogous DNA oligomers, 
and this can lead to difficulty solubilizing them at neutral pH, especially if the PNAs 
have a high purine content or if they have the potential to form secondary structures. 
Their solubility can be enhanced by attaching one or more positive charges to the PNA 
termini (Nielsen et al, 1991). 

Findings by Allfrey and colleagues suggest that strand invasion will 
occur spontaneously at sequences within chromosomal DNA (Boffa et al, 1995; Boffa 
et al, 1996). These studies targeted PNAs to triplet repeats of the nucleotides CAG and 
used this recognition to purify transcriptionally active DNA (Boffa et al, 1995) and to 
inhibit transcription (Boffa et al, 1996). This result suggests that if PNAs can be 
delivered within cells then they will have the potential to be general sequence-specific 
regulators of gene expression. Studies and reviews concerning the use of PNAs as 
antisense and anti-gene agents include Nielsen et al. (1993b), Hanvey et al (1992), and 
Good and Nielsen (1997). Koppelhus et al (1997) have used PNAs to inhibit HIV-1 
inverse transcription, showing that PNAs may be used for antiviral therapies. 

Methods of characterizing the antisense binding properties of PNAs are 
discussed in Rose (1993) and Jensen et al (1997). Rose uses capillary gel 
electrophoresis to determine binding of PNAs to their complementary oligonucleotide, 
measuring the relative binding kinetics and stoichiometry. Similar types of 
measurements were made by Jensen et al using BIAcore™ technology. 

Other applications of PNAs include use in DNA strand invasion (Nielsen 
et al, 1991), antisense inhibition (Hanvey et al, 1992), mutational analysis (Orum et 
al, 1993), enhancers of transcription (Mollegaard et al, 1994), nucleic acid purification 
(Orum et al, 1995), isolation of transcriptionally active genes (Boffa et al, 1995), 
blocking of transcription factor binding (Vickers et al, 1995), genome cleavage 
(Veselkov et al, 1996), biosensors (Wang et al, 1996), in situ hybridization (Thisted et 
al, 1996), and in a alternative to Southern blotting (Perry-O'Keefe, 1996). 
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Polypeptide Compositions 

The present invention, in other aspects, provides polypeptide 
compositions. Generally, a polypeptide of the invention will be an isolated polypeptide 
(or an epitope, variant, or active fragment thereof) derived from HSV. Preferably, the 
polypeptide is encoded by a polynucleotide sequence disclosed herein or a sequence 
which hybridizes under moderate or highly stringent conditions to a polynucleotide 
sequence disclosed herein. Alternatively, the polypeptide may be defined as a 
polypeptide which comprises a contiguous amino acid sequence from an amino acid 
sequence disclosed herein, or which polypeptide comprises an entire amino acid 
sequence disclosed herein. 

In the present invention, a polypeptide composition is also understood to 
comprise one or more polypeptides that are immunologically reactive with antibodies 
and/or T cells generated against a polypeptide of the invention, particularly a 
polypeptide having amino acid sequences disclosed herein, or to active fragments, or to 
variants or biological functional equivalents thereof. 

Likewise, a polypeptide composition of the present invention is 
understood to comprise one or more polypeptides that are capable of eliciting antibodies 
or T cells that are immunologically reactive with one or more polypeptides encoded by 
one or more contiguous nucleic acid sequences contained in the amino acid sequences 
disclosed herein, or to active fragments, or to variants thereof, or to one or more nucleic 
acid sequences which hybridize to one or more of these sequences under conditions of 
moderate to high stringency. Particularly illustrative polypeptides comprise the amino 
acid sequence disclosed in SEQ ID NO: 2, 5, 6, 9, 1 0, 1 1, 14, 17, 20, 21 , 22 and 25. 

As used herein, an active fragment of a polypeptide includes a whole or a 
portion of a polypeptide which is modified by conventional techniques, e.g., 
mutagenesis, or by addition, deletion, or substitution, but which active fragment 
exhibits substantially the same structure function, antigenicity, etc., as a polypeptide as 
described herein. 

In certain illustrative embodiments, the polypeptides of the invention 
will comprise at least an immunogenic portion of an HSV antigen or a variant or 
biological functional equivalent thereof, as described herein. Polypeptides as described 
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herein may be of any length. Additional sequences derived from the native protein 
and/or heterologous sequences may be present, and such sequences may (but need not) 
possess further immunogenic or antigenic properties. 

An "immunogenic portion," as used herein is a portion of a protein that 
is recognized (/.e., specifically bound) by a B-cell -and/or T-cell surface antigen receptor. 
Such immunogenic portions generally comprise at least 5 amino acid residues, more 
preferably at least 10, and still more preferably at least 20 amino acid residues of an 
HSV protein or a variant thereof. Certain preferred immunogenic portions include 
peptides in which an N-terminal leader sequence and/or transmembrane domain have 
been deleted. Other preferred immunogenic portions may contain a small N- and/or C- 
tenninal deletion (e.g., 1-30 amino acids, preferably 5-15 amino acids), relative to the 
mature protein. 

Immunogenic portions may generally be identified using well known 
techniques, such as those summarized in Paul, Fundamental Immunology, 3rd ed., 243- 
247 (Raven Press, 1993) and references cited therein. Such techniques include 
screening polypeptides for the ability to react with antigen-specific antibodies, antisera 
and/or T-cell lines or clones. As used herein, antisera and antibodies are "antigen- 
specific" if they specifically bind to an antigen (i.e., they react with the protein in an 
ELISA or other immunoassay, and do not react detectably with unrelated proteins). 
Such antisera and antibodies may be prepared as described herein, and using well 
known techniques. An immunogenic portion of a native HSV protein is a portion that 
reacts with such antisera and/or T-cells at a level that is not substantially less than the 
reactivity of the full length polypeptide (e.g., in an ELISA and/or T-cell reactivity 
assay). Such immunogenic portions may react within such assays at a level that is 
similar to or greater than the reactivity of the full length polypeptide. Such screens may 
generally be performed using methods well known to those of ordinary skill in the art, 
such as those described in Harlow and Lane, Antibodies: A Laboratory Manual, Cold 
Spring Harbor Laboratory, 1988. For example, a polypeptide may be immobilized on a 
solid support and contacted with patient sera to allow binding of antibodies within the 
sera to the immobilized polypeptide. Unbound sera may then be removed and bound 
antibodies detected using, for example, 125 I-labeled Protein A. 
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As noted above, a composition may comprise a variant of a native HSV 
protein. A polypeptide 'Variant/' as used herein, is a polypeptide that differs from a 
native HSV protein in one or more substitutions, deletions, additions and/or insertions, 
such that the immunogenicity of the polypeptide is not substantially diminished. In 
other words, the ability of a variant to react with antigen-specific antisera may be 
enhanced or unchanged, relative to the native protein, or may be diminished by less than 
50%, and preferably less than 20%, relative to the native protein. Such variants may 
generally be identified by modifying one of the above polypeptide sequences and 
evaluating the reactivity of the modified polypeptide with antigen-specific antibodies or 
antisera as described herein. Preferred variants include those in which one or more 
portions, such as an N-terminal leader sequence or transmembrane domain, have been 
removed. Other preferred variants include variants in which a small portion (e.g., 1-30 
amino acids, preferably 5-15 amino acids) has been removed from the N- and/or C- 
terminal of the mature protein. 

Polypeptide variants encompassed by the present invention include those 
exhibiting at least about 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 
97%, 98%, or 99% or more identity (determined as described above) to the polypeptides 
disclosed herein. 

Preferably, a variant contains conservative substitutions. A 
"conservative substitution" is one in which an amino acid is substituted for another 
amino acid that has similar properties, such that one skilled in the art of peptide 
chemistry would expect the secondary structure and hydropathic nature of the 
polypeptide to be substantially unchanged. Amino acid substitutions may generally be 
made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity and/or the amphipathic nature of the residues. For example, negatively 
charged amino acids include aspartic acid and glutamic acid; positively charged amino 
acids include lysine and arginine; and amino acids with uncharged polar head groups 
having similar hydrophilicity values include leucine, isoleucine and valine; glycine and 
alanine; asparagine and glutamine; and serine, threonine, phenylalanine and tyrosine. 
Other groups of amino acids that may represent conservative changes include: (1) ala, 
pro, gly, glu, asp, gin, asn, ser, thr; (2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; 
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(4) lys, arg, his; and (5) phe, tyr, trp, his. A variant may also, or alternatively, contain 
nonconservative changes. In a preferred embodiment, variant polypeptides differ from a 
native sequence by substitution, deletion or addition of five amino acids or fewer. 
Variants may also (or alternatively) be modified by, for example, the deletion or 
addition of amino acids that have minimal influence on the immunogenicity, secondary 
structure and hydropathic nature of the polypeptide. 

As noted above, polypeptides may comprise a signal (or leader) sequence 
at the N-terminal end of the protein, which co-translationally or post-translationally 
directs transfer of the protein. The polypeptide may also be conjugated to a linker or 
other sequence for ease of synthesis, purification , or identification of the polypeptide 
(e.g., poly-His), or to enhance binding of the polypeptide to a solid support. For 
example, a polypeptide may be conjugated to an immunoglobulin Fc region. 

Polypeptides may be prepared using any of a variety of well known 
techniques. Recombinant polypeptides encoded by DNA sequences as described above 
may be readily prepared from the DNA sequences using any of a variety of expression 
vectors known to those of ordinary skill in the art. Expression may be achieved in any 
appropriate host cell that has been transformed or transfected with an expression vector 
containing a DNA molecule that encodes a recombinant polypeptide. Suitable host cells 
include prokaryotes, yeast, and higher eukaryotic cells, such as mammalian cells and 
plant cells. Preferably, the host cells employed are E. coli, yeast or a mammalian cell 
line such as COS or CHO. Supernatants from suitable host/vector systems which 
secrete recombinant protein or polypeptide into culture media may be first concentrated 
using a commercially available filter. Following concentration, the concentrate may be 
applied to a suitable purification matrix such as an affinity matrix or an ion exchange 
resin. Finally, one or more reverse phase HPLC steps can be employed to further purify 
a recombinant polypeptide. 

Portions and other variants having less than about 100 amino acids, and 
generally less than about 50 amino acids, may also be generated by synthetic means, 
using techniques well known to those of ordinary skill in the art. For example, such 
polypeptides may be synthesized using any of the commercially available solid-phase 
techniques, such as the Merrifield solid-phase synthesis method, where amino acids are 
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sequentially added to a growing amino acid chain. See Merrifield, 1 Am. Chem. Soc. 
55:2149-2146, 1963. Equipment for automated synthesis of polypeptides is 
commercially available from suppliers such as Perkin Elmer/Applied BioSystems 
Division (Foster City, CA), and may be operated according to the manufacturer's 
instructions. 

Within certain specific embodiments, a polypeptide may be a fusion 
protein that comprises multiple polypeptides as described herein, or that comprises at 
least one polypeptide as described herein and an unrelated sequence, such as a known 
protein. A fusion partner may, for example, assist in providing T helper epitopes (an 
immunological fusion partner), preferably T helper epitopes recognized by humans, or 
may assist in expressing the protein (an expression enhancer) at higher yields than the 
native recombinant protein. Certain preferred fusion partners are both immunological 
and expression enhancing fusion partners. Other fusion partners may be selected so as 
to increase the solubility of the protein or to enable the protein to be targeted to desired 
intracellular compartments. Still further fusion partners include affinity tags, which 
facilitate purification of the protein. 

Fusion proteins may generally be prepared using standard techniques, 
including chemical conjugation. Preferably, a fusion protein is expressed as a 
recombinant protein, allowing the production of increased levels, relative to a non-fused 
protein, in an expression system. Briefly, DNA sequences encoding the polypeptide 
components may be assembled separately, and ligated into an appropriate expression 
vector. The 3* end of the DNA sequence encoding one polypeptide component is 
ligated, with or without a peptide linker, to the 5' end of a DNA sequence encoding the 
second polypeptide component so that the reading frames of the sequences are in phase. 
This permits translation into a single fusion protein that retains the biological activity of 
both component polypeptides. 

A peptide linker sequence may be employed to separate the first and 
second polypeptide components by a distance sufficient to ensure that each polypeptide 
folds into its secondary and tertiary structures. Such a peptide linker sequence is 
incorporated into the fusion protein using standard techniques well known in the art. 
Suitable peptide linker sequences may be chosen based on the following factors: 
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(1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a 
secondary structure that could interact with functional epitopes on the first and second 
polypeptides; and (3) the lack of hydrophobic or charged residues that might react with 
the polypeptide functional epitopes. Preferred peptide linker sequences contain Gly, 
Asn and Ser residues. Other near neutral amino acids, such as Thr and Ala may also be 
used in the linker sequence. Amino acid sequences which may be usefully employed as 
linkers include those disclosed in Maratea et al., Gene 40:39-46, 1985; Murphy et al., 
Proc. Natl. Acad Set USA 55:8258-8262, 1986; U.S. Patent No. 4,935,233 and U.S. 
Patent No. 4,751,180. The linker sequence may generally be from 1 to about 50 amino 
acids in length. Linker sequences are not required when the first and second 
polypeptides have non-essential N-terminal amino acid regions that can be used to 
separate the functional domains and prevent steric interference. 

The ligated DNA sequences are operably linked to suitable 
transcriptional or translational regulatory elements. The regulatory elements 
responsible for expression of DNA are located only 5' to the DNA sequence encoding 
the first polypeptides. Similarly, stop codons required to end translation and 
transcription termination signals are only present 3' to the DNA sequence encoding the 
second polypeptide. 

Fusion proteins are also provided. Such proteins comprise a polypeptide 
as described herein together with an unrelated immunogenic protein. Preferably the 
immunogenic protein is capable of eliciting a recall response. Examples of such 
proteins include tetanus, tuberculosis and hepatitis proteins (see, for example, Stoute 
et al. New Engl. J. Med., 35(5:86-91, 1997). 

Within preferred embodiments, an immunological fusion partner is 
derived from protein D, a surface protein of the gram-negative bacterium Haemophilus 
influenza B (WO 91/18926). Preferably, a protein D derivative comprises 
approximately the first third of the protein (e.g., the first N-terminal 100-110 amino 
acids), and a protein D derivative may be lipidated. Within certain preferred 
embodiments, the first 109 residues of a Lipoprotein D fusion partner is included on the 
N-terminus to provide the polypeptide with additional exogenous T-cell epitopes and to 
increase the expression level in E. coli (thus functioning as an expression enhancer). 
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The lipid tail ensures optimal presentation of the antigen to antigen presenting cells. 
Other fusion partners include the non-structural protein from influenzae virus, NS1 
(hemaglutinin). Typically, the N-terminal 81 amino acids are used, although different 
fragments that include T-helper epitopes may be used. 

In another embodiment, a Mycobacterium tuberculosis-derived Ral2 
polynucleotide is linked to at least an immunogenic portion of an HSV polynucleotide 
of this invention. Ral2 compositions and methods for their use in enhancing expression 
of heterologous polynucleotide sequences is described in U.S. Patent Application 
60/158,585, the disclosure of which is incorporated herein by reference in its entirety. 
Briefly, Ral2 refers to a polynucleotide region that is a subsequence of a 
Mycobacterium tuberculosis MTB32A nucleic acid. MTB32A is a serine protease of 32 
KD molecular weight encoded by a gene in virulent and avirulent strains of M 
tuberculosis. The nucleotide sequence and amino acid sequence of MTB32A have been 
disclosed (U.S. Patent Application 60/158,585; see also, Skeiky et aL, Infection and 
Immun. (1999) 67:3998-4007, incorporated herein by reference). The Ral2 C-terminal 
fragment of the MTB32A coding sequence expresses at high levels on its own and 
remains as a soluble protein throughout the purification process. Moreover, the 
presence of Ral2 polypeptide fragments in a fusion polypeptide may enhance the 
immunogenicity of the heterologous antigenic HSV polypeptides with which Ral2 is 
fused. In one embodiment, the Ral2 polypeptide sequence present in a fusion 
polypeptide with an HSV antigen comprises some or all of amino acid residues 192 to 
323 of MTB32A. 

In another embodiment, the immunological fusion partner is the protein 
known as LYTA, or a portion thereof (preferably a C-terminal portion). LYTA is 
derived from Streptococcus pneumoniae, which synthesizes an N-acetyl-L-alanine 
amidase known as amidase LYTA (encoded by the LytA gene; Gene 45:265-292, 1986). 
LYTA is an autolysin that specifically degrades certain bonds in the peptidoglycan 
backbone. The C-terminal domain of the LYTA protein is responsible for the affinity to 
the choline or to some choline analogues such as DEAE. This property has been 
exploited for the development of E. coli C-LYTA expressing plasmids useful for 
expression of fusion proteins. Purification of hybrid proteins containing the C-LYTA 
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fragment at the amino terminus has been described {see Biotechnology 70:795-798, 
1992). Within a preferred embodiment, a repeat portion of LYTA may be incorporated 
into a fusion protein. A repeat portion is found in the C-terminal region starting at 
residue 178. A particularly preferred repeat portion incorporates residues 1 88-305. 

In general, polypeptides (including fusion proteins) and polynucleotides 
as described herein are isolated. An "isolated" polypeptide or polynucleotide is one that 
is removed from its original environment. For example, a naturally-occurring protein is 
isolated if it is separated from some or all of the coexisting materials in the natural 
system. Preferably, such polypeptides are at least about 90% pure, more preferably at 
least about 95% pure and most preferably at least about 99% pure. A polynucleotide is 
considered to be isolated if, for example, it is cloned into a vector that is not a part of 
the natural environment. 

Binding Agents 

The present invention further provides agents, such as antibodies and 
antigen-binding fragments thereof, that specifically bind to a HSV protein. As used 
herein, an antibody, or antigen-binding fragment thereof, is said to "specifically bind" to 
a HSV protein if it reacts at a detectable level (within, for example, an ELISA) with a 
HSV protein, and does not react detectably with unrelated proteins under similar 
conditions. As used herein, "binding" refers to a noncovalent association between two 
separate molecules such that a complex is formed. The ability to bind may be evaluated 
by, for example, determining a binding constant for the formation of the complex. The 
binding constant is the value obtained when the concentration of the complex is divided 
by the product of the component concentrations. In general, two compounds are said to 
"bind," in the context of the present invention, when the binding constant for complex 
formation exceeds about 10 3 L/mol. The binding constant may be determined using 
methods well known in the art. 

Binding agents may be further capable of differentiating between patients 
with and without HSV infection using the representative assays provided herein. For 
example, preferably, antibodies or other binding agents that bind to a HSV protein will 
generate a signal indicating the presence of infection in at least about 20% of patients 
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with the disease, and will generate a negative signal indicating the absence of the 
disease in at least about 90% of individuals without an HSV infection. To determine 
whether a binding agent satisfies this requirement, biological samples (e.g., blood, sera, 
sputum, urine and/or biopsies) from patients with and without HSV (as determined 
using standard clinical tests) may be assayed as described herein for the presence of 
polypeptides that bind to the binding agent. It will be apparent that a statistically 
significant number of samples with and without the disease should be assayed. Each 
binding agent should satisfy the above criteria; however, those of ordinary skill in the 
art will recognize that binding agents may be used in combination to improve 
sensitivity. 

Any agent that satisfies the above requirements may be a binding agent. 
For example, a binding agent may be a ribosome, with or without a peptide component, 
an RNA molecule or a polypeptide. In a preferred embodiment, a binding agent is an 
antibody or an antigen-binding fragment thereof. Antibodies may be prepared by any of 
a variety of techniques known to those of ordinary skill in the art. See, e.g., Harlow and 
Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In 
general, antibodies can be produced by cell culture techniques, including the generation 
of monoclonal antibodies as described herein, or via transfection of antibody genes into 
suitable bacterial or mammalian cell hosts, in order to allow for the production of 
recombinant antibodies. In one technique, an immunogen comprising the polypeptide is 
initially injected into any of a wide variety of mammals (e.g., mice, rats, rabbits, sheep 
or goats). In this step, the polypeptides of this invention may serve as the immunogen 
without modification. Alternatively, particularly for relatively short polypeptides, a 
superior immune response may be elicited if the polypeptide is joined to a carrier 
protein, such as bovine serum albumin or keyhole limpet hemocyanin. The immunogen 
is injected into the animal host, preferably according to a predetermined schedule 
incorporating one or more booster immunizations, and the animals are bled periodically. 
Polyclonal antibodies specific for the polypeptide may then be purified from such 
antisera by, for example, affinity chromatography using the polypeptide coupled to a 
suitable solid support. 

Monoclonal antibodies specific for an antigenic polypeptide of interest 
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may be prepared, for example, using the technique of Kohler and Milstein, Eur. J. 
Immunol (5:511-519, 1976, and improvements thereto. Briefly, these methods involve 
the preparation of immortal cell lines capable of producing antibodies having the 
desired specificity (ie. 9 reactivity with the polypeptide of interest). Such cell lines may 
be produced, for example, from spleen cells obtained from an animal immunized as 
described above. The spleen cells are then immortalized by, for example, fusion with a 
myeloma cell fusion partner, preferably one that is syngeneic with the immunized 
animal. A variety of fusion techniques may be employed. For example, the spleen cells 
and myeloma cells may be combined with a nonionic detergent for a few minutes and 
then plated at low density on a selective medium that supports the growth of hybrid 
cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, 
aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, 
colonies of hybrids are observed. Single colonies are selected and their culture 
supernatants tested for binding activity against the polypeptide. Hybridomas having 
high reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated from the supernatants of growing 
hybridoma colonies. In addition, various techniques may be employed to enhance the 
yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable 
vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from 
the ascites fluid or the blood. Contaminants may be removed from the antibodies by 
conventional techniques, such as chromatography, gel filtration, precipitation, and 
extraction. The polypeptides of this invention may be used in the purification process 
in, for example, an affinity chromatography step. 

Within certain embodiments, the use of antigen-binding fragments of 
antibodies may be preferred. Such fragments include Fab fragments, which may be 
prepared using standard techniques. Briefly, immunoglobulins may be purified from 
rabbit serum by affinity chromatography on Protein A bead columns (Harlow and Lane, 
Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988) and digested 
by papain to yield Fab and Fc fragments. The Fab and Fc fragments may be separated 
by affinity chromatography on protein A bead columns. 

Monoclonal antibodies of the present invention may be coupled to one or 
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more therapeutic agents. Suitable agents in this regard include radionuclides, 
differentiation inducers, drugs, toxins, and derivatives thereof. Preferred radionuclides 
include ^Y, 123 I, 125 I, 131 I, 186 Re, 188 Re, 2n At, and 212 Bi. Preferred drugs include 
methotrexate, and pyrimidine and purine analogs. Preferred differentiation inducers 
include phorbol esters and butyric acid. Preferred toxins include ricin, abrin, diptheria 
toxin, cholera toxin, gelonin, Pseudomonas exotoxin, Shigella toxin, and pokeweed 
antiviral protein. 

A therapeutic agent may be coupled (e.g., covalently bonded) to a 
suitable monoclonal antibody either directly or indirectly (e.g., via a linker group). A 
direct reaction between an agent and an antibody is possible when each possesses a 
substituent capable of reacting with the other. For example, a nucleophilic group, such 
as an amino or sulfhydryl group, on one may be capable of reacting with a carbonyl- 
containing group, such as an anhydride or an acid halide, or with an alkyl group 
containing a good leaving group (e.g., a halide) on the other. 

Alternatively, it may be desirable to couple a therapeutic agent and an 
antibody via a linker group. A linker group can function as a spacer to distance an 
antibody from an agent in order to avoid interference with binding capabilities. A linker 
group can also serve to increase the chemical reactivity of a substituent on an agent or 
an antibody, and thus increase the coupling efficiency. An increase in chemical 
reactivity may also facilitate the use of agents, or functional groups on agents, which 
otherwise would not be possible. 

It will be evident to those skilled in the art that a variety of Afunctional 
or polyfunctional reagents, both homo- and hetero-functional (such as those described in 
the catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker 
group. Coupling may be effected, for example, through amino groups, carboxyl groups, 
sulfhydryl groups or oxidized carbohydrate residues. There are numerous references 
describing such methodology, e.g., U.S. Patent No. 4,671,958, to Rodwell et al. 

Where a therapeutic agent is more potent when free from the antibody 
portion of the immunoconjugates of the present invention, it may be desirable to use a 
linker group which is cleavable during or upon internalization into a cell. A number of 
different cleavable linker groups have been described. The mechanisms for the 
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intracellular release of an agent from these linker groups include cleavage by reduction 
of a disulfide bond (e.g., U.S. Patent No. 4,489,710, to Spitler), by irradiation of a 
photolabile bond (e.g., U.S. Patent No. 4,625,014, to Senter etal.), by hydrolysis of 
derivatized amino acid side chains (e.g., U.S. Patent No. 4,638,045, to Kohn et al.), by 
serum complement-mediated hydrolysis (e.g., U.S. Patent No. 4,671,958, to Rodwell 
et al.), and acid-catalyzed hydrolysis (e.g., U.S. Patent No. 4,569,789, to Blattler et al.). 

It may be desirable to couple more than one agent to an antibody. In one 
embodiment, multiple molecules of an agent are coupled to one antibody molecule. In 
another embodiment, more than one type of agent may be coupled to one antibody. 
Regardless of the particular embodiment, immunoconjugates with more than one agent 
may be prepared in a variety of ways. For example, more than one agent may be 
coupled directly to an antibody molecule, or linkers that provide multiple sites for 
attachment can be used. Alternatively, a carrier can be used. 

A carrier may bear the agents in a variety of ways, including covalent 
bonding either directly or via a linker group. Suitable carriers include proteins such as 
albumins (e.g., U.S. Patent No. 4,507,234, to Kato et al.), peptides and polysaccharides 
such as aminodextran (e.g., U.S. Patent No. 4,699,784, to Shih et al.). A carrier may 
also bear an agent by noncovalent bonding or by encapsulation, such as within a 
liposome vesicle (e.g., U.S. Patent Nos. 4,429,008 and 4,873,088). Carriers specific for 
radionuclide agents include radiohalogenated small molecules and chelating 
compounds. For example, U.S. Patent No. 4,735,792 discloses representative 
radiohalogenated small molecules and their synthesis. A radionuclide chelate may be 
formed from chelating compounds that include those containing nitrogen and sulfur 
atoms as the donor atoms for binding the metal, or metal oxide, radionuclide. For 
example, U.S. Patent No. 4,673,562, to Davison et al. discloses representative chelating 
compounds and their synthesis. 

A variety of routes of administration for the antibodies and 
immunoconjugates may be used. Typically, administration will be intravenous, 
intramuscular, subcutaneous and the like. It will be evident that the precise dose of the 
antibody/immunoconjugate will vary depending upon the antibody used, the antigen 
density, and the rate of clearance of the antibody. 
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T Cells 

Immunotherapeutic compositions may also, or alternatively, comprise T 
cells specific for HS V protein. Such cells may generally be prepared in vitro or ex vivo, 
using standard procedures. For example, T cells may be isolated from bone marrow, 
peripheral blood, or a fraction of bone marrow or peripheral blood of a patient, using a 
commercially available cell separation system, such as the Isolex™ System, available 
from Nexell Therapeutics, Inc. (Irvine, CA; see also U.S. Patent No. 530,856; U.S. 
Patent No. 5,215,926; WO 89/06280; WO 91/161 16 and WO 92/07243). Alternatively, 
T cells may be derived from related or unrelated humans, non-human mammals, cell 
lines or cultures. 

T cells may be stimulated with a HSV polypeptide, polynucleotide 
encoding a HSV polypeptide and/or an antigen presenting cell (APC) that expresses 
such a polypeptide. Such stimulation is performed under conditions and for a time 
sufficient to permit the generation of T cells that are specific for the polypeptide. In 
certain embodiments, HSV polypeptide or polynucleotide is present within a delivery 
vehicle, such as a microsphere, to facilitate the generation of specific T cells. 

T cells are considered to be specific for a HSV polypeptide if the T cells 
specifically proliferate, secrete cytokines or kill target cells coated with the polypeptide 
or expressing a gene encoding the polypeptide. T cell specificity may be evaluated 
using any of a variety of standard techniques. For example, within a chromium release 
assay or proliferation assay, a stimulation index of more than two fold increase in lysis 
and/or proliferation, compared to negative controls, indicates T cell specificity. Such 
assays may be performed, for example, as described in Chen et al., Cancer Res. 
54:1065-1070, 1994. Alternatively, detection of the proliferation of T cells may be 
accomplished by a variety of known techniques. For example, T cell proliferation can 
be detected by measuring an increased rate of DNA synthesis (e.g., by pulse-labeling 
cultures of T cells with tritiated thymidine and measuring the amount of tritiated 
thymidine incorporated into DNA). Contact with a HSV polypeptide (100 ng/ml - 100 
|ag/ml, preferably 200 ng/ml - 25 |ig/ml) for 3 - 7 days should result in at least a two 
fold increase in proliferation of the T cells. Contact as described above for 2-3 hours 
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should result in activation of the T cells, as measured using standard cytokine assays in 
which a two fold increase in the level of cytokine release (e.g., TNF or IFN-y) is 
indicative of T cell activation (see Coligan et al., Current Protocols in Immunology, vol. 
1, Wiley Interscience (Greene 1998)). T cells that have been activated in response to a 
HSV polypeptide, polynucleotide or polypeptide-expressing APC may be CD4 + and/or 
CD8 + . HSV protein-specific T cells may be expanded using standard techniques. 
Within preferred embodiments, the T cells are derived from a patient, a related donor or 
an unrelated donor, and are administered to the patient following stimulation and 
expansion. 

For therapeutic purposes, CD4+ or CD8 + T cells that proliferate in 
response to a HSV polypeptide, polynucleotide or APC can be expanded in number 
either in vitro or in vivo. Proliferation of such T cells in vitro may be accomplished in a 
variety of ways. For example, the T cells can be re-exposed to a HSV polypeptide, or a 
short peptide corresponding to an immunogenic portion of such a polypeptide, with or 
without the addition of T cell growth factors, such as interleukin-2, and/or stimulator 
cells that synthesize a HSV polypeptide. Alternatively, one or more T cells that 
proliferate in the presence of a HSV protein can be expanded in number by cloning. 
Methods for cloning cells are well known in the art, and include limiting dilution. 

Pharmaceutical Compositions 

In additional embodiments, the present invention concerns formulation 
of one or more of the polynucleotide, polypeptide, T-cell and/or antibody compositions 
disclosed herein in pharmaceutically-acceptable solutions for administration to a cell or 
an animal, either alone, or in combination with one or more other modalities of therapy. 

It will also be understood that, if desired, the nucleic acid segment, RNA, 
DNA or PNA compositions that express a polypeptide as disclosed herein may be 
administered in combination with other agents as well, such as, e.g., other proteins or 
polypeptides or various pharmaceutically-active agents. In fact, there is virtually no 
limit to other components that may also be included, given that the additional agents do 
not cause a significant adverse effect upon contact with the target cells or host tissues. 
The compositions may thus be delivered along with various other agents as required in 
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the particular instance. Such compositions may be purified from host cells or other 
biological sources, or alternatively may be chemically synthesized as described herein. 
Likewise, such compositions may further comprise substituted or derivatized RNA or 
DNA compositions. ' 

Formulation of pharmaceutically-acceptable excipients and carrier 
solutions is well-known to those of skill in the art, as is the development of suitable 
dosing and treatment regimens for using the particular compositions described herein in 
a variety of treatment regimens, including e.g., oral, parenteral, intravenous, intranasal, 
and intramuscular administration and formulation. 

1. Oral Delivery 

In certain applications, the pharmaceutical compositions disclosed herein 
may be delivered via oral administration to an animal. As such, these compositions 
may be formulated with an inert diluent or with an assimilable edible carrier, or they 
may be enclosed in hard- or soft-shell gelatin capsule, or they may be compressed into 
tablets, or they may be incorporated directly with the food of the diet. 

The active compounds may even be incorporated with excipients and 
used in the form of ingestible tablets, buccal tables, troches, capsules, elixirs, 
suspensions, syrups, wafers, and the like (Mathiowitz et aL, 1991 \ Hwang et aL, 1998; 
U. S. Patent 5,641,515; U. S. Patent 5,580,579 and U. S. Patent 5,792,451, each 
specifically incorporated herein by reference in its entirety). The tablets, troches, pills, 
capsules and the like may also contain the following: a binder, as gum tragacanth, 
acacia, cornstarch, or gelatin; excipients, such as dicalcium phosphate; a disintegrating 
agent, such as corn starch, potato starch, alginic acid and the like; a lubricant, such as 
magnesium stearate; and a sweetening agent, such as sucrose, lactose or saccharin may 
be added or a flavoring agent, such as peppermint, oil of wintergreen, or cherry 
flavoring. When the dosage unit form is a capsule, it may contain, in addition to 
materials of the above type, a liquid carrier. Various other materials may be present as 
coatings or to otherwise modify the physical form of the dosage unit. For instance, 
tablets, pills, or capsules may be coated with shellac, sugar, or both. A syrup of elixir 
may contain the active compound sucrose as a sweetening agent methyl and 
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propylparabens as preservatives, a dye and flavoring, such as cherry or orange flavor. 
Of course, any material used in preparing any dosage unit form should be 
pharmaceutical^ pure and substantially non-toxic in the amounts employed. In 
addition, the active compounds may be incorporated into sustained-release preparation 
and formulations. 

Typically, these formulations may contain at least about 0.1% of the 
active compound or more, although the percentage of the active ingredient(s) may, of 
course, be varied and may conveniently be between about 1 or 2% and about 60% or 
70% or more of the weight or volume of the total formulation. Naturally, the amount of 
active compound(s) in each therapeutically useful composition may be prepared is such 
a way that a suitable dosage will be obtained in any given unit dose of the compound. 
Factors such as solubility, bioavailability, biological half-life, route of administration, 
product shelf life, as well as other pharmacological considerations will be contemplated 
by one skilled in the art of preparing such pharmaceutical formulations, and as such, a 
variety of dosages and treatment regimens may be desirable. 

For oral administration the compositions of the present invention may 
alternatively be incorporated with one or more excipients in the form of a mouthwash, 
dentifrice, buccal tablet, oral spray, or sublingual orally-administered formulation. For 
example, a mouthwash may be prepared incorporating the active ingredient in the 
required amount in an appropriate solvent, such as a sodium borate solution (Dobell's 
Solution). Alternatively, the active ingredient may be incorporated into an oral solution 
such as one containing sodium borate, glycerin and potassium bicarbonate, or dispersed 
in a dentifrice, or added in a therapeutically-effective amount to a composition that may 
include water, binders, abrasives, flavoring agents, foaming agents, and humectants. 
Alternatively the compositions may be fashioned into a tablet or solution form that may 
be placed under the tongue or otherwise dissolved in the mouth. 

2. Injectable Delivery 

In certain circumstances it will be desirable to deliver the pharmaceutical 
compositions disclosed herein parenterally, intravenously, intramuscularly, or even 
intraperitoneally as described in U. S. Patent 5,543,158; U. S. Patent 5,641,515 and U. 
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S. Patent 5,399,363 (each specifically incorporated herein by reference in its entirety). 
Solutions of the active compounds as free base or pharmacologically acceptable salts 
may be prepared in water suitably mixed with a surfactant, such as 
hydroxypropylcellulose. Dispersions may also be prepared in glycerol, liquid 
polyethylene glycols, and mixtures thereof and in oils. Under ordinary conditions of 
storage and use, these preparations contain a preservative to prevent the growth of 
microorganisms. 

The pharmaceutical forms suitable for injectable use include sterile 
aqueous solutions or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions (U. S. Patent 5,466,468, 
specifically incorporated herein by reference in its entirety). In all cases the form must 
be sterile and must be fluid to the extent that easy syringability exists. It must be stable 
under the conditions of manufacture and storage and must be preserved against the 
contaminating action of microorganisms, such as bacteria and fungi. The carrier can be 
a solvent or dispersion medium containing, for example, water, ethanol, polyol {e.g., 
glycerol, propylene glycol, and liquid polyethylene glycol, and the like), suitable 
mixtures thereof, and/or vegetable oils. Proper fluidity may be maintained, for example, 
by the use of a coating, such as lecithin, by the maintenance of the required particle size 
in the case of dispersion and by the use of surfactants. The prevention of the action of 
microorganisms can be facilitated by various antibacterial and antifungal agents, for 
example, parabens, chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In 
many cases, it will be preferable to include isotonic agents, for example, sugars or 
sodium chloride. Prolonged absorption of the injectable compositions can be brought 
about by the use in the compositions of agents delaying absorption, for example, 
aluminum monostearate and gelatin. 

For parenteral administration in an aqueous solution, for example, the 
solution should be suitably buffered if necessary and the liquid diluent first rendered 
isotonic with sufficient saline or glucose. These particular aqueous solutions are 
especially suitable for intravenous, intramuscular, subcutaneous and intraperitoneal 
administration. In this connection, a sterile aqueous medium that can be employed will 
be known to those of skill in the art in light of the present disclosure. For example, one 
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dosage may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml 
of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, 
"Remington's Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570- 
1580). Some variation in dosage will necessarily occur depending on the condition of 
the subject being treated. The person responsible for administration will, in any event, 
determine the appropriate dose for the individual subject. Moreover, for human 
administration, preparations should meet sterility, pyrogenicity, and the general safety 
and purity standards as required by FDA Office of Biologies standards. 

Sterile injectable solutions are prepared by incorporating the active 
compounds in the required amount in the appropriate solvent with various of the other 
ingredients enumerated above, as required, followed by filtered sterilization. Generally, 
dispersions are prepared by incorporating the various sterilized active ingredients into a 
sterile vehicle which contains the basic dispersion medium and the required other 
ingredients from those enumerated above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the preferred methods of preparation are 
vacuum-drying and freeze-drying techniques which yield a powder of the active 
ingredient plus any additional desired ingredient from a previously sterile-filtered 
solution thereof. 

The compositions disclosed herein may be formulated in a neutral or salt 
form. Pharmaceutically-acceptable salts, include the acid addition salts (formed with 
the free amino groups of the protein) and which are formed with inorganic acids such 
as, for example, hydrochloric or phosphoric acids, or such organic acids as acetic, 
oxalic, tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can 
also be derived from inorganic bases such as, for example, sodium, potassium, 
ammonium, calcium, or ferric hydroxides, and such organic bases as isopropylamine, 
trimethylamine, histidine, procaine and the like. Upon formulation, solutions will be 
administered in a manner compatible with the dosage formulation and in such amount 
as is therapeutically effective. The formulations are easily administered in a variety of 
dosage forms such as injectable solutions, drug-release capsules, and the like. 

As used herein, "carrier" includes any and all solvents, dispersion media, 
vehicles, coatings, diluents, antibacterial and antifungal agents, isotonic and absorption 
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delaying agents, buffers, carrier solutions, suspensions, colloids, and the like. The use 
of such media and agents for pharmaceutical active substances is well known in the art. 
Except insofar as any conventional media or agent is incompatible with the active 
ingredient, its use in the therapeutic compositions is contemplated. Supplementary 
active ingredients can also be incorporated into the compositions. 

The phrase "pharmaceutically-acceptable" refers to molecular entities 
and compositions that do not produce an allergic or similar untoward reaction when 
administered to a human. The preparation of an aqueous composition that contains a 
protein as an active ingredient is well understood in the art. Typically, such 
compositions are prepared as injectables, either as liquid solutions or suspensions; solid 
forms suitable for solution in, or suspension in, liquid prior to injection can also be 
prepared. The preparation can also be emulsified. 

3. Nasal Delivery 

In certain embodiments, the pharmaceutical compositions may be 
delivered by intranasal sprays, inhalation, and/or other aerosol delivery vehicles. 
Methods for delivering genes, nucleic acids, and peptide compositions directly to the 
lungs via nasal aerosol sprays has been described e.g., in U. S. Patent 5,756,353 and U. 
S. Patent 5,804,212 (each specifically incorporated herein by reference in its entirety). 
Likewise, the delivery of drugs using intranasal microparticle resins (Takenaga et aL, 
1998) and lysophosphatidyl-glycerol compounds (U. S. Patent 5,725,871, specifically 
incorporated herein by reference in its entirety) are also well-known in the 
pharmaceutical arts, Likewise, transmucosal drug delivery in the form of a 
polytetrafluoroetheylene support matrix is described in U. S. Patent 5,780,045 
(specifically incorporated herein by reference in its entirety). 

4. Liposome-, Nanocapsule-, and Microparticle-Mediated Delivery 
In certain embodiments, the inventors contemplate the use of liposomes, 

nanocapsules, microparticles, microspheres, lipid particles, vesicles, and the like, for the 
introduction of the compositions of the present invention into suitable host cells. In 
particular, the compositions of the present invention may be formulated for delivery 
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either encapsulated in a lipid particle, a liposome, a vesicle, a nanosphere, or a 
nanoparticle or the like. 

Such formulations may be preferred for the introduction of 
pharmaceutically-acceptable formulations of the nucleic acids or constructs disclosed 
herein. The formation and use of liposomes is generally known to those of skill in the 
art (see for example, Couvreur et al, 1977; Couvreur, 1988; Lasic, 1998; which 
describes the use of liposomes and nanocapsules in the targeted antibiotic therapy for 
intracellular bacterial infections and diseases). Recently, liposomes were developed 
with improved serum stability and circulation half-times (Gabizon and 
Papahadjopoulos, 1988; Allen and Choun, 1987; U. S. Patent 5,741,516, specifically 
incorporated herein by reference in its entirety). Further, various methods of liposome 
and liposome like preparations as potential drug carriers have been reviewed (Takakura, 
1998; Chandran et al, 1997; Margalit, 1995; U. S. Patent 5,567,434; U. S. Patent 
5,552,157; U. S. Patent 5,565,213; U. S. Patent 5,738,868 and U. S. Patent 5,795,587, 
each specifically incorporated herein by reference in its entirety). 

Liposomes have been used successfully with a number of cell types that 
are normally resistant to transfection by other procedures including T cell suspensions, 
primary hepatocyte cultures and PC 12 cells (Renneisen et al, 1990; Muller et al> 
1990). In addition, liposomes are free of the DNA length constraints that are typical of 
viral-based delivery systems. Liposomes have been used effectively to introduce genes, 
drugs (Heath and Martin, 1986; Heath etal, 1986; Balazsovits etal, 1989; Fresta and 
Puglisi, 1996), radiotherapeutic agents (Pikul etal, 1987), enzymes (Imaizumi etal, 
1990a; Imaizumi etal, 1990b), viruses (Faller and Baltimore, 1984), transcription 
factors and allosteric effectors (Nicolau and Gersonde, 1979) into a variety of cultured 
cell lines and animals. In addition, several successful clinical trails examining the 
effectiveness of liposome-mediated drug delivery have been completed (Lopez- 
Beresteinef a/., 1985a; 1985b; Coune, 1988; Soulier et al, 1988). Furthermore, several 
studies suggest that the use of liposomes is not associated with autoimmune responses, 
toxicity or gonadal localization after systemic delivery (Mori and Fukatsu, 1992). 

Liposomes are formed from phospholipids that are dispersed in an 
aqueous medium and spontaneously form multilamellar concentric bilayer vesicles (also 
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termed multilamellar vesicles (MLVs). MLVs generally have diameters of from 25 nm 
to 4 |Lim. Sonication of MLVs results in the formation of small unilamellar vesicles 
(SUVs) with diameters in the range of 200 to 500 A, containing an aqueous solution in 
the core. 

Liposomes bear resemblance to cellular membranes and are 
contemplated for use in connection with the present invention as carriers for the peptide 
compositions. They are widely suitable as both water- and lipid-soluble substances can 
be entrapped, i.e. in the aqueous spaces and within the bilayer itself, respectively. It is 
possible that the drug-bearing liposomes may even be employed for site-specific 
delivery of active agents by selectively modifying the liposomal formulation. 

In addition to the teachings of Couvreur et al (1977; 1988), the 
following information may be utilized in generating liposomal formulations. 
Phospholipids can form a variety of structures other than liposomes when dispersed in 
water, depending on the molar ratio of lipid to water. At low ratios the liposome is the 
preferred structure. The physical characteristics of liposomes depend on pH, ionic 
strength and the presence of divalent cations. Liposomes can show low permeability to 
ionic and polar substances, but at elevated temperatures undergo a phase transition 
which markedly alters their permeability. The phase transition involves a change from a 
closely packed, ordered structure, known as the gel state, to a loosely packed, less- 
ordered structure, known as the fluid state. This occurs at a characteristic phase- 
transition temperature and results in an increase in permeability to ions, sugars and 
drugs. 

In addition to temperature, exposure to proteins can alter the 
permeability of liposomes. Certain soluble proteins, such as cytochrome c, bind, 
deform and penetrate the bilayer, thereby causing changes in permeability. Cholesterol 
inhibits this penetration of proteins, apparently by packing the phospholipids more 
tightly. It is contemplated that the most useful liposome formations for antibiotic and 
inhibitor delivery will contain cholesterol. 

The ability to trap solutes varies between different types of liposomes. 
For example, MLVs are moderately efficient at trapping solutes, but SUVs are 
extremely inefficient. SUVs offer the advantage of homogeneity and reproducibility in 
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size distribution, however, and a compromise between size and trapping efficiency is 
offered by large unilamellar vesicles (LUVs). These are prepared by ether evaporation 
and are three to four times more efficient at solute entrapment than MLVs. 

In addition to liposome characteristics, an important determinant in 
entrapping compounds is the physicochemical properties of the compound itself. Polar 
compounds are trapped in the aqueous spaces and nonpolar compounds bind to the lipid 
bilayer of the vesicle. Polar compounds are released through permeation or when the 
bilayer is broken, but nonpolar compounds remain affiliated with the bilayer unless it is 
disrupted by temperature or exposure to lipoproteins. Both types show maximum efflux 
rates at the phase transition temperature. 

Liposomes interact with cells via four different mechanisms: 
endocytosis by phagocytic cells of the reticuloendothelial system such as macrophages 
and neutrophils; adsorption to the cell surface, either by nonspecific weak hydrophobic 
or electrostatic forces, or by specific interactions with cell-surface components; fusion 
with the plasma cell membrane by insertion of the lipid bilayer of the liposome into the 
plasma membrane, with simultaneous release of liposomal contents into the cytoplasm; 
and by transfer of liposomal lipids to cellular or subcellular membranes, or vice versa, 
without any association of the liposome contents. It often is difficult to determine 
which mechanism is operative and more than one may operate at the same time. 

The fate and disposition of intravenously injected liposomes depend on 
their physical properties, such as size, fluidity, and surface charge. They may persist in 
tissues for h or days, depending on their composition, and half lives in the blood range 
from min to several h. Larger liposomes, such as MLVs and LUVs, are taken up rapidly 
by phagocytic cells of the reticuloendothelial system, but physiology of the circulatory 
system restrains the exit of such large species at most sites. They can exit only in places 
where large openings or pores exist in the capillary endothelium, such as the sinusoids 
of the liver or spleen. Thus, these organs are the predominate site of uptake. On the 
other hand, SUVs show a broader tissue distribution but still are sequestered highly in 
the liver and spleen. In general, this in vivo behavior limits the potential targeting of 
liposomes to only those organs and tissues accessible to their large size. These include 
the blood, liver, spleen, bone marrow, and lymphoid organs. 
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Targeting is generally not a limitation in terms of the present invention. 
However, should specific targeting be desired, methods are available for this to be 
accomplished. Antibodies may be used to bind to the liposome surface and to direct the 
antibody and its drug contents to specific antigenic receptors located on a particular 
cell-type surface. Carbohydrate determinants (glycoprotein or glycolipid cell-surface 
components that play a role in cell-cell recognition, interaction and adhesion) may also 
be used as recognition sites as they have potential in directing liposomes to particular 
cell types. Mostly, it is contemplated that intravenous injection of liposomal 
preparations would be used, but other routes of administration are also conceivable. 

Alternatively, the invention provides for pharmaceutically-acceptable 
nanocapsule formulations of the compositions of the present invention. Nanocapsules 
can generally entrap compounds in a stable and reproducible way (Henry-Michelland 
etal, 1987; Quintanar-Guerrero et ah, 1998; Douglas et ah, 1987). To avoid side 
effects due to intracellular polymeric overloading, such ultrafine particles (sized around 
0.1 pm) should be designed using polymers able to be degraded in vivo. Biodegradable 
polyalkyl-cyanoacrylate nanoparticles that meet these requirements are contemplated for 
use in the present invention. Such particles may be are easily made, as described 
(Couvreur etal, 1980; 1988; zur Muhlen et ah, 1998; Zambaux et ah 1998; Pinto- 
Alphandry et ah, 1995 and U. S. Patent 5,145,684, specifically incorporated herein by 
reference in its entirety). 

Vaccines 

In certain preferred embodiments of the present invention, vaccines are 
provided. The vaccines will generally comprise one or more pharmaceutical 
compositions, such as those discussed above, in combination with an immunostimulant. 
An immunostimulant may be any substance that enhances or potentiates an immune 
response (antibody and/or cell-mediated) to an exogenous antigen. Examples of 
immunostimulants include adjuvants, biodegradable microspheres {e.g., polylactic 
galactide) and liposomes (into which the compound is incorporated; see e.g., Fullerton, 
U.S. Patent No. 4,235,877). Vaccine preparation is generally described in, for example, 
M.F. Powell and M.J. Newman, eds., "Vaccine Design (the subunit and adjuvant 
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approach)," Plenum Press (NY, 1995). Pharmaceutical compositions and vaccines 
within the scope of the present invention may also contain other compounds, which may 
be biologically active or inactive. For example, one or more immunogenic portions of 
other HS V antigens may be present, either incorporated into a fusion polypeptide or as a 
separate compound, within the composition or vaccine. 

Illustrative vaccines may contain DNA encoding one or more of the 
polypeptides as described above, such that the polypeptide is generated in situ. As 
noted above, the DNA may be present within any of a variety of delivery systems 
known to those of ordinary skill in the art, including nucleic acid expression systems, 
bacteria and viral expression systems. Numerous gene delivery techniques are well 
known in the art, such as those described by Rolland, Crit. Rev. Therap. Drug Carrier 
Systems 75:143-198, 1998, and references cited therein. Appropriate nucleic acid 
expression systems contain the necessary DNA sequences for expression in the patient 
(such as a suitable promoter and terminating signal). Bacterial delivery systems involve 
the administration of a bacterium (such as Bacillus-Calmette-Guerriri) that expresses an 
immunogenic portion of the polypeptide on its cell surface or secretes such an epitope. 
In a preferred embodiment, the DNA may be introduced using a viral expression system 
{e.g., vaccinia or other pox virus, retrovirus, or adenovirus), which may involve the use 
of a non-pathogenic (defective), replication competent virus. Suitable systems are 
disclosed, for example, in Fisher-Hoch et al., Proc. Natl. Acad Sci. USA 55:317-321, 
1989; Flexner et al., Ann. N.Y. Acad. Sci. 5<5P:86-103, 1989; Flexner et al., Vaccine 
5:17-21, 1990; U.S. Patent Nos. 4,603,112, 4,769,330, and 5,017,487; WO 89/01973; 
U.S. Patent No. 4,777,127; GB 2,200,651; EP 0,345,242; WO 91/02805; Berkner, 
Biotechniques 6:616-627, 1988; Rosenfeld et al., Science 252:431-434, 1991; Kolls et 
al., Proc. Natl. Acad Sci. USA Pi:215-219, 1994; Kass-Eisler et al., Proc. Natl Acad 
Sci. USA P0:11498-115O2, 1993; Guzman et al., Circulation 55:2838-2848, 1993; and 
Guzman et al., Cir. Res. 73:1202-1207, 1993. Techniques for incorporating DNA into 
such expression systems are well known to those of ordinary skill in the art. The DNA 
may also be "naked," as described, for example, in Ulmer et al., Science 259: 1745- 1749, 
1993 and reviewed by Cohen, Science 2JP:1691-1692, 1993. The uptake of naked 
DNA may be increased by coating the DNA onto biodegradable beads, which are 
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efficiently transported into the cells. It will be apparent that a vaccine may comprise 
both a polynucleotide and a polypeptide component. Such vaccines may provide for an 
enhanced immune response. 

It will be apparent that a vaccine may contain pharmaceutical^ 
acceptable salts of the polynucleotides and polypeptides provided herein. Such salts 
may be prepared from pharmaceutical^ acceptable non-toxic bases, including organic 
bases (e.g., salts of primary, secondary and tertiary amines and basic amino acids) and 
inorganic bases (e.g., sodium, potassium, lithium, ammonium, calcium and magnesium 
salts). 

While any suitable carrier known to those of ordinary skill in the art may 
be employed in the vaccine compositions of this invention, the type of carrier will vary 
depending on the mode of administration. Compositions of the present invention may 
be formulated for any appropriate manner of administration, including for example, 
topical, oral, nasal, intravenous, intracranial, intraperitoneal, subcutaneous or 
intramuscular administration. For parenteral administration, such as subcutaneous 
injection, the carrier preferably comprises water, saline, alcohol, a fat, a wax or a buffer. 
For oral administration, any of the above carriers or a solid carrier, such as mannitol, 
lactose, starch, magnesium stearate, sodium saccharine, talcum, cellulose, glucose, 
sucrose, and magnesium carbonate, may be employed. Biodegradable microspheres 
{e.g., polylactate polyglycolate) may also be employed as carriers for the pharmaceutical 
compositions of this invention. Suitable biodegradable microspheres are disclosed, for 
example, in U.S. Patent Nos. 4,897,268; 5,075,109; 5,928,647; 5,811,128; 5,820,883; 
5,853,763; 5,814,344 and 5,942,252. Modified hepatitis B core protein carrier systems 
are also suitable, such as those described in WO/99 40934, and references cited therein, 
all incorporated herein by reference. One may also employ a carrier comprising the 
particulate-protein complexes described in U.S. Patent No. 5,928,647, which are 
capable of inducing a class I-restricted cytotoxic T lymphocyte responses in a host. 

Such compositions may also comprise buffers (e.g., neutral buffered 
saline or phosphate buffered saline), carbohydrates (e.g., glucose, mannose, sucrose or 
dextrans), mannitol, proteins, polypeptides or amino acids such as glycine, antioxidants, 
bacteriostats, chelating agents such as EDTA or glutathione, adjuvants (e.g., aluminum 
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hydroxide), solutes that render the formulation isotonic, hypotonic or weakly hypertonic 
with the blood of a recipient, suspending agents, thickening agents and/or preservatives. 
Alternatively, compositions of the present invention may be formulated as a 
lyophilizate. Compounds may also be encapsulated within liposomes using well known 
technology. 

Any of a variety of immunostimulants may be employed in the vaccines 
of this invention. For example, an adjuvant may be included. Most adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum 
hydroxide or mineral oil, and a stimulator of immune responses, such as lipid A, 
Bortadella pertussis or Mycobacterium tuberculosis derived proteins. Suitable 
adjuvants are commercially available as, for example, Freund's Incomplete Adjuvant 
and Complete Adjuvant (Difco Laboratories, Detroit, MI); Merck Adjuvant 65 (Merck 
and Company, Inc., Rahway, NJ); AS-2 (SmithKline Beecham, Philadelphia, PA); 
aluminum salts such as aluminum hydroxide gel (alum) or aluminum phosphate; salts of 
calcium, iron or zinc; an insoluble suspension of acylated tyrosine; acylated sugars; 
cationically or anionically derivatized polysaccharides; polyphosphazenes; 
biodegradable microspheres; monophosphoryl lipid A and quil A. Cytokines, such as 
GM-CSF or interleukin-2, -7, or -12, may also be used as adjuvants. 

Within the vaccines provided herein, the adjuvant composition is 
preferably designed to induce an immune response predominantly of the Thl type. 
High levels of Thl-type cytokines (e.g., IFN-y, TNFa, IL-2 and IL-12) tend to favor the 
induction of cell mediated immune responses to an administered antigen. In contrast, 
high levels of Th2-type cytokines (e.g., DL-4, IL-5, JL-6 and IL-10) tend to favor the 
induction of humoral immune responses. Following application of a vaccine as 
provided herein, a patient will support an immune response that includes Thl- and Th2- 
type responses. Within a preferred embodiment, in which a response is predominantly 
Thl-type, the level of Thl-type cytokines will increase to a greater extent than the level 
of Th2-type cytokines. The levels of these cytokines may be readily assessed using 
standard assays. For a review of the families of cytokines, see Mosmann and Coffinan, 
Ann. Rev. Immunol 7:145-173, 1989. 

Preferred adjuvants for use in eliciting a predominantly Thl-type 
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response include, for example, a combination of monophosphoryl lipid A, preferably 3- 
de-O-acylated monophosphoryl lipid A (3D-MPL), together with an aluminum salt. 
MPL adjuvants are available from Corixa Corporation (Seattle, WA; see US Patent Nos. 
4,436,727; 4,877,611; 4,866,034 and 4,912,094). CpG-containing oligonucleotides (in 
which the CpG dinucleotide is unmethylated) also induce a predominantly Thl 
response. Such oligonucleotides are well known and are described, for example, in WO 
96/02555, WO 99/33488 and U.S. Patent Nos. 6,008,200 and 5,856,462. 
Immunostimulatory DNA sequences are also described, for example, by Sato et al., 
Science 275:352, 1996. Another preferred adjuvant is a saponin, preferably QS21 
(Aquila Biophannaceuticals Inc., Framingham, MA), which may be used alone or in 
combination with other adjuvants. For example, an enhanced system involves the 
combination of a monophosphoryl lipid A and saponin derivative, such as the 
combination of QS21 and 3D-MPL as- described in WO 94/00153, or a less reactogenic 
composition where the QS21 is quenched with cholesterol, as described in WO 
96/33739. Other preferred formulations comprise an oil-in-water emulsion and 
tocopherol. A particularly potent adjuvant. formulation involving QS21, 3D-MPL and 
tocopherol in an oil-in-water emulsion is described in WO 95/17210. 

Other preferred adjuvants include Montanide ISA 720 (Seppic, France), 
SAF (Chiron, California, United States), ISCOMS (CSL), MF-59 (Chiron), the SBAS 
series of adjuvants (e.g., SBAS-2 or SBAS-4, available from SmithKline Beecham, 
Rixensart, Belgium), Detox (Corixa, Hamilton, MT), RC-529 (Corixa, Hamilton, MT) 
and other aminoalkyl glucosaminide 4-phosphates (AGPs), such as those described in 
pending U.S. Patent Application Serial Nos. 08/853,826 and 09/074,720, the disclosures 
of which are incorporated herein by reference in their entireties. Other preferred 
adjuvants comprise polyoxyethylene ethers, such as those described in WO 
99/52549A1. 

Any vaccine provided herein may be prepared using well known 
methods that result in a combination of antigen, immune response enhancer and a 
suitable carrier or excipient. The compositions described herein may be administered as 
part of a sustained release formulation {i.e., a formulation such as a capsule, sponge or 
gel (composed of polysaccharides, for example) that effects a slow release of compound 
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following administration). Such formulations may generally be prepared using well 
known technology (see, e.g., Coombes et al., Vaccine 74:1429-1438, 1996) and 
administered by, for example, oral, rectal or subcutaneous implantation, or by 
implantation at the desired target site. Sustained-release formulations may contain a 
polypeptide, polynucleotide or antibody dispersed in a carrier matrix and/or contained 
within a reservoir surrounded by a rate controlling membrane. 

Carriers for use within such formulations are biocompatible, and may 
also be biodegradable; preferably the formulation provides a relatively constant level of 
active component release. Such carriers include microparticles of poly(lactide-co- 
glycolide), polyacrylate, latex, starch, cellulose, dextran and the like. Other delayed- 
release carriers include supramolecular biovectors, which comprise a non-liquid 
hydrophilic core {e.g., a cross-linked polysaccharide or oligosaccharide) and, optionally, 
an external layer comprising an amphiphilic compound, such as a phospholipid {see 
e.g., U.S. Patent No. 5,151,254 and PCT applications WO 94/20078, WO/94/23701 and 
WO 96/06638). The amount of active compound contained within a sustained release 
formulation depends upon the site of implantation, the rate and expected duration of 
release and the nature of the condition to be treated or prevented. 

Any of a variety of delivery vehicles may be employed within 
pharmaceutical compositions and vaccines to facilitate production of an antigen-specific 
immune response that targets HSV-infected cells. Delivery vehicles include antigen 
presenting cells (APCs), such as dendritic cells, macrophages, B cells, monocytes and 
other cells that may be engineered to be efficient APCs. Such cells may, but need not, 
be genetically modified to increase the capacity for presenting the antigen, to improve 
activation and/or maintenance of the T cell response, to have anti-HSV effects per se 
and/or to be immunologically compatible with the receiver {i.e., matched HLA 
haplotype). APCs may generally be isolated from any of a variety of biological fluids 
and organs and may be autologous, allogeneic, syngeneic or xenogeneic cells. 

Certain preferred embodiments of the present invention use dendritic 
cells or progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent 
APCs (Banchereau and Steinman, Nature 5P2:245-251, 1998) and have been shown to 
be effective as a physiological adjuvant for eliciting prophylactic or therapeutic 
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immunity {see Timmerman and Levy, Ann Rev. Med 50:507-529, 1999). In general, 
dendritic cells may be identified based on their typical shape (stellate in situ, with 
marked cytoplasmic processes (dendrites) visible in vitro), their ability to take up, 
process and present antigens with high efficiency and their ability to activate naive T 
cell responses. Dendritic cells may, of course, be engineered to express specific cell- 
surface receptors or ligands that are not commonly found on dendritic cells in vivo or ex 
vivo, and such modified dendritic cells are contemplated by the present invention. As 
an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called 
exosomes) may be used within a vaccine {see Zitvogel et al., Nature Med 4:594-600, 
1998). 

Dendritic cells and progenitors may be obtained from peripheral blood, 
bone marrow, lymph nodes, spleen, skin, umbilical cord blood or any other suitable 
tissue or fluid. For example, dendritic cells may be differentiated ex vivo by adding a 
combination of cytokines such as GM-CSF, IL-4, IL-13 and/or TNFa to cultures of 
monocytes harvested from peripheral blood. Alternatively, CD34 positive cells 
harvested from peripheral blood, umbilical cord blood or bone marrow may be 
differentiated into dendritic cells by adding to the culture medium combinations of GM- 
CSF, IL-3, TNFa, CD40 ligand, LPS, flt3 ligand and/or other compound(s) that induce 
differentiation, maturation and proliferation of dendritic cells. 

Dendritic cells are conveniently categorized as "immature" and "mature" 
cells, which allows a simple way to discriminate between two well characterized 
phenotypes. However, this nomenclature should not be construed to exclude all 
possible intermediate stages of differentiation. Immature dendritic cells are 
characterized as APC with a high capacity for antigen uptake and processing, which 
correlates with the high expression of Fey receptor and mannose receptor. The mature 
phenotype is typically characterized by a lower expression of these markers, but a high 
expression of cell surface molecules responsible for T cell activation such as class I and 
class II MHC, adhesion molecules {e.g., CD54 and CD11) and costimulatory molecules 
(e.g., CD40, CD80, CD86 and 4-1BB). 

APCs may generally be transfected with a polynucleotide encoding a 
HSV protein (or portion or other variant thereof) such that the HSV polypeptide, or an 
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immunogenic portion thereof, is expressed on the cell surface. Such transfection may 
take place ex vivo, and a composition or vaccine comprising such transfected cells may 
then be used for therapeutic purposes, as described herein. Alternatively, a gene 
delivery vehicle that targets a dendritic or other antigen presenting cell may be 
administered to a patient, resulting in transfection that occurs in vivo. In vivo and ex 
vivo transfection of dendritic cells, for example, may generally be performed using any 
methods known in the art, such as those described in WO 97/24447, or the gene gun 
approach described by Mahvi et al., Immunology and cell Biology 75:456-460, 1997. 
Antigen loading of dendritic cells may be achieved by incubating dendritic cells or 
progenitor cells with the HSV polypeptide, DNA (naked or within a plasmid vector) or 
RNA; or with antigen-expressing recombinant bacterium or viruses {e.g., vaccinia, 
fowlpox, adenovirus or lentivirus vectors). Prior to loading, the polypeptide may be 
covalently conjugated to an immunological partner that provides T cell help (e.g., a 
carrier molecule). Alternatively, a dendritic cell may be pulsed with a non-conjugated 
immunological partner, separately or in the presence of the polypeptide. 

Vaccines and pharmaceutical compositions may be presented in unit- 
dose or multi-dose containers, such as sealed ampoules or vials. Such containers are 
preferably hermetically sealed to preserve sterility of the formulation until use. In 
general, formulations may be stored as suspensions, solutions or emulsions in oily or 
aqueous vehicles. Alternatively, a vaccine or pharmaceutical composition may be 
stored in a freeze-dried condition requiring only the addition of a sterile liquid carrier 
immediately prior to use. 

Immunotherapeutic Applications 

In further aspects of the present invention, the compositions described 
herein may be used for immunotherapy of HSV infections. Within such methods, 
pharmaceutical compositions and vaccines are typically administered to a patient. As 
used herein, a "patient" refers to any warm-blooded animal, preferably a human. The 
above pharmaceutical compositions and vaccines may be used to prophylactically 
prevent or ameliorate the extent of infection by HSV or to treat a patient already 
infected with HSV. Administration may be by any suitable method, including 
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administration by intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal, 
intradermal, anal, vaginal, topical, and oral routes. 

Within certain embodiments, immunotherapy may be active 
immunotherapy, in which treatment relies on the in vivo stimulation of the endogenous 
host immune system to react against HSV infection with the administration of immune 
response-modifying agents (such as polypeptides and polynucleotides as provided 
herein). 

Within other embodiments, immunotherapy may be passive 
immunotherapy, in which treatment involves the delivery of agents with established 
HSV-immune reactivity (such as effectpr cells or antibodies) that can directly or 
indirectly mediate therapeutic effects and does not necessarily depend on an intact host 
immune system. Examples of effector cells include T cells as discussed above, T 
lymphocytes (such as CD8 + cytotoxic T lymphocytes and CD4 + T-helper lymphocytes), 
killer cells (such as Natural Killer cells and lymphokine-activated killer cells), B cells 
and antigen-presenting cells (such as dendritic cells and macrophages) expressing a 
polypeptide provided herein. T cell receptors and antibody receptors specific for the 
polypeptides recited herein may be cloned, expressed and transferred into other vectors 
or effector cells for adoptive immunotherapy. The polypeptides provided herein may 
also be used to generate antibodies or anti-idiotypic antibodies (as described above and 
in U.S. Patent No. 4,918,164) for passive immunotherapy. 

Effector cells may generally be obtained in sufficient quantities for 
adoptive immunotherapy by growth in vitro, as described herein. Culture conditions for 
expanding single antigen-specific effector cells to several billion in number with 
retention of antigen recognition in vivo are well known in the art. Such in vitro culture 
conditions typically use intermittent stimulation with antigen, often in the presence of 
cytokines (such as BL-2) and non-dividing feeder cells. As noted above, 
immunoreactive polypeptides as provided herein may be used to rapidly expand 
antigen-specific T cell cultures in order to generate a sufficient number of cells for 
immunotherapy. In particular, antigen-presenting cells, such as dendritic, macrophage, 
monocyte, fibroblast and/or B cells, may be pulsed with immunoreactive polypeptides 
or transfected with one or more polynucleotides using standard techniques well known 
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in the art. For example, antigen-presenting cells can be transfected with a 
polynucleotide having a promoter appropriate for increasing expression in a 
recombinant virus or other expression system. Cultured effector cells for use in therapy 
must be able to grow and distribute widely, and to survive long term in vivo. Studies 
have shown that cultured effector cells can be induced to grow in vivo and to survive 
long term in substantial numbers by repeated stimulation with antigen supplemented 
with IL-2 (see, for example, Cheever et al. 5 Immunological Reviews 757:177, 1997). 

Alternatively, a vector expressing a polypeptide recited herein may be 
introduced into antigen presenting cells taken from a patient and clonally propagated ex 
vivo for transplant back into the same patient. Transfected cells may be reintroduced 
into the patient using any means known in the art, preferably in sterile form by 
intravenous, intracavitary or intraperitoneal. 

Routes and frequency of administration of the therapeutic compositions 
described herein, as well as dosage, will vary from individual to individual, but may be 
readily established using standard techniques. In one embodiment, between 1 and about- 
10 doses may be administered over a 52 week period. In another embodiment, about 6 
doses are administered, at intervals of about 1 month, and booster vaccinations are 
typically be given periodically thereafter. Alternate protocols may be appropriate for 
individual patients. 

A suitable dose is an amount of a compound that, when administered as 
described above, is capable of promoting an anti-HSV immune response, and is 
preferably at least 10-50% above the basal (i.e., untreated) level. Such response can be 
monitored, for example, by measuring the anti-HSV antibodies in a patient. Such 
vaccines should also be capable of causing an immune response that leads to an 
improved clinical outcome (e.g., more frequent remissions, complete or partial or longer 
disease-free survival) in vaccinated patients as compared to non-vaccinated patients. In 
general, for pharmaceutical compositions and vaccines comprising one or more 
polypeptides, the amount of each polypeptide present in a dose ranges from about 25 |ig 
to 5 mg per kg of host. Suitable dose sizes will vary with the size of the patient, but will 
typically range from about 0.1 mL to about 5 mL. 

In general, an appropriate dosage and treatment regimen provides the 
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active compound(s) in an amount sufficient to provide therapeutic and/or prophylactic 
benefit. Such a response can be monitored by establishing an improved clinical 
outcome (e.g., more frequent remissions, complete or partial, or longer disease-free 
survival) in treated patients as compared to non-treated patients. Increases in 
preexisting immune responses to a HS V protein may correlate with an improved clinical 
outcome. Such immune responses may generally be evaluated using standard 
proliferation, cytotoxicity or cytokine assays, which may be performed using samples 
obtained from a patient before and after treatment. 

HSV Detection and Diagnosis 

In general, HSV may be detected in a patient based on the presence of 
one or more HSV proteins and/or polynucleotides encoding such proteins in a biological 
sample (for example, blood, sera, sputum urine and/or other appropriate tissue) obtained 
from the patient. In other words, such proteins may be used as markers to indicate the 
presence or absence of HSV in a patient. The binding agents provided herein generally 
permit detection of the level of antigen that binds to the agent in the biological sample. 
Polynucleotide primers and probes may be used to detect the level of mRNA encoding a 
HSV protein, which is also indicative of the presence or absence of HSV infection. 

There are a variety of assay formats known to those of ordinary skill in 
the art for using a binding agent to detect polypeptide markers in a sample. See, e.g., 
Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 
1988. In general, the presence or absence of HSV in a patient may be determined by 
contacting a biological sample obtained from a patient with a binding agent and 
detecting in the sample a level of polypeptide that binds to the binding agent. 

In a preferred embodiment, the assay involves the use of binding agent 
immobilized on a solid support to bind to and remove the polypeptide from the 
remainder of the sample. The bound polypeptide may then be detected using a detection 
reagent that contains a reporter group and specifically binds to the binding 
agent/polypeptide complex. Such detection reagents may comprise, for example, a 
binding agent that specifically binds to the polypeptide or an antibody or other agent 
that specifically binds to the binding agent, such as an antiimmunoglobulin, protein G, 
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protein A or a lectin. Alternatively, a competitive assay may be utilized, in which a 
polypeptide is labeled with a reporter group and allowed to bind to the immobilized 
binding agent after incubation of the binding agent with the sample. The extent to 
which components of the sample inhibit the binding of the labeled polypeptide to the 
binding agent is indicative of the reactivity of the sample with the immobilized binding 
agent. Suitable polypeptides for use within such assays include full length HSV 
proteins and portions thereof to which the binding agent binds, as described above. 

The solid support may be any material known to those of ordinary skill 
in the art to which the protein may be attached. For example, the solid support may be a 
test well in a microtiter plate or a nitrocellulose or other suitable membrane. 
Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 
plastic material such as polystyrene or polyvinylchloride. The support may also be a 
magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No. 5,359,681. The binding agent may be immobilized on the solid support 
using a variety of techniques known to those of skill in the art, which are amply 
described in the patent and scientific literature. In the context of the present invention, 
the term "immobilization" refers to both noncovalent association, such as adsorption, 
and covalent attachment (which may be a direct linkage between the agent and 
functional groups on the support or may be a linkage by way of a cross-linking agent). 
Immobilization by adsorption to a well in a microtiter plate or to a membrane is 
preferred. In such cases, adsorption may be achieved by contacting the binding agent, in 
a suitable buffer, with the solid support for a suitable amount of time. The contact time 
varies with temperature, but is typically between about 1 hour and about 1 day. In 
general, contacting a well of a plastic microtiter plate (such as polystyrene or 
polyvinylchloride) with an amount of binding agent ranging from about 10 ng to about 
10 jig, and preferably about 100 ng to about 1 ng, is sufficient to immobilize an 
adequate amount of binding agent. 

Covalent attachment of binding agent to a solid support may generally be 
achieved by first reacting the support with a Afunctional reagent that will react with 
both the support and a functional group, such as a hydroxyl or amino group, on the 
binding agent. For example, the binding agent may be covalently attached to supports 
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having an appropriate polymer coating using benzoquinone or by condensation of an 
aldehyde group on the support with an amine and an active hydrogen on the binding 
partner (see, e.g., Pierce Immunotechnology Catalog and Handbook, 1991, at 
A12-A13). 

In certain embodiments, the assay is a two-antibody sandwich assay. 
This assay may be performed by first contacting an antibody that has been immobilized 
on a solid support, commonly the well of a microtiter plate, with the sample, such that 
polypeptides within the sample are allowed to bind to the immobilized antibody. 
Unbound sample is then removed from the immobilized polypeptide-antibody 
complexes and a detection reagent (preferably a second antibody capable of binding to a 
different site on the polypeptide) containing a reporter group is added. The amount of 
detection reagent that remains bound to the solid support is then determined using a 
method appropriate for the specific reporter group. 

More specifically, once the antibody is immobilized on the support as 
described above, the remaining protein binding sites on the support are typically 
blocked. Any suitable blocking agent known to those of ordinary skill in the art, such as 
bovine serum albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The 
immobilized antibody is then incubated with the sample, and polypeptide is allowed to 
bind to the antibody. The sample may be diluted with a suitable diluent, such as 
phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact 
time (i.e., incubation time) is a period of time that is sufficient to detect the presence of 
polypeptide within a sample obtained from an individual with an HSV infection. 
Preferably, the contact time is sufficient to achieve a level of binding that is at least 
about 95% of that achieved at equilibrium between bound and unbound polypeptide. 
Those of ordinary skill in the art will recognize that the time necessary to achieve 
equilibrium may be readily determined by assaying the level of binding that occurs over 
a period of time. At room temperature, an incubation time of about 30 minutes is 
generally sufficient. 

Unbound sample may then be removed by washing the solid support 
with an appropriate buffer, such as PBS containing 0.1% Tween 20™. The second 
antibody, which contains a reporter group, may then be added to the solid support. 
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Preferred reporter groups include those groups recited above. 

' The detection reagent is then incubated with the immobilized antibody- 

polypeptide complex for an amount of time sufficient to detect the bound polypeptide. 
An appropriate amount of time may generally be determined by assaying the level of 
binding that occurs over a period of time. Unbound detection reagent is then removed 
and bound detection reagent is detected using the reporter group. The method employed 
for detecting the reporter group depends upon the nature of the reporter group. For 
radioactive groups, scintillation counting or autoradiographic methods are generally 
appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups 
and fluorescent groups. Biotin may be detected using avidin, coupled to a different 
reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme 
reporter groups may generally be detected by the addition of substrate (generally for a 
specific period of time), followed by spectroscopic or other analysis of the reaction 
products. 

To determine the presence or absence of HSV, the signal detected from 
the reporter group that remains bound to the solid support is generally compared to a 
signal that corresponds to a predetermined cut-off value. In one embodiment, the cut- 
off value for the detection of HSV is the average mean signal obtained when the 
immobilized antibody is incubated with samples from patients without HSV. In an 
alternate embodiment, the cut-off value is determined using a Receiver Operator Curve, 
according to the method of Sackett et al., Clinical Epidemiology: A Basic Science for 
Clinical Medicine, Little Brown and Co., 1985, p. 106-7. Briefly, in this embodiment, 
the cut-off value may be determined from a plot of pairs of true positive rates (f.e., 
sensitivity) and false positive rates (100%-specificity) that correspond to each possible 
cut-off value for the diagnostic test result. The cut-off value on the plot that is the 
closest to the upper left-hand corner (i.e. 9 the value that encloses the largest area) is the 
most accurate cut-off value, and a sample generating a signal that is higher than the cut- 
off value determined by this method may be considered positive. Alternatively, the cut- 
off value may be shifted to the left along the plot, to minimize the false positive rate, or 
to the right, to minimize the false negative rate. In general, a sample generating a signal 
that is higher than the cut-off value determined by this method is considered positive. 
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In a related embodiment, the assay is performed in a flow-through or 
strip test format, wherein the binding agent is immobilized on a membrane, such as 
nitrocellulose. In the flow-through test, polypeptides within the sample bind to the 
immobilized binding agent as the sample passes through the membrane. A second, 
labeled binding agent then binds to the binding agent-polypeptide complex as a solution 
containing the second binding agent flows through the membrane. The detection of 
bound second binding agent may then be performed as described above. In the strip test 
format, one end of the membrane to which binding agent is bound is immersed in a 
solution containing the sample. The sample migrates along the membrane through a 
region containing second binding agent and to the area of immobilized binding agent. 
Concentration of second binding agent at the area of immobilized antibody indicates the 
presence of HSV. Typically, the concentration of second binding agent at that site 
generates a pattern, such as a line, that can be read visually. The absence of such a 
pattern indicates a negative result. In general, the amount of binding agent immobilized 
on the membrane is selected to generate a visually discernible pattern when the 
biological sample contains a level of polypeptide that would be sufficient to generate a 
positive signal in the two-antibody sandwich assay, in the format discussed above. 
Preferred binding agents for use in such assays are antibodies and antigen-binding 
fragments thereof. Preferably, the amount of antibody immobilized on the membrane 
ranges from about 25 ng to about ljag, and more preferably from about 50 ng to about 
500 ng. Such tests can typically be performed with a very small amount of biological 
sample. 

Of course, numerous other assay protocols exist that are suitable for use 
with the HSV proteins or binding agents of the present invention. The above 
descriptions are intended to be exemplary only. For example, it will be apparent to 
those of ordinary skill in the art that the above protocols may be readily modified to use 
HSV polypeptides to detect antibodies that bind to such polypeptides in a biological 
sample. The detection of such protein-specific antibodies can allow for the 
identification of HSV infection. 

HSV infection may also, or alternatively, be detected based on the 
presence of T cells that specifically react with a HSV protein in a biological sample. 
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Within certain methods, a biological sample comprising CD4 + and/or CD8 + T cells 
isolated from a patient is incubated with a HSV polypeptide, a polynucleotide encoding 
such a polypeptide and/or an APC that expresses at least an immunogenic portion of 
such a polypeptide, and the presence or absence of specific activation of the T cells is 
detected. Suitable biological samples include, but are not limited to, isolated T cells. 
For example, T cells may be isolated from a patient by routine techniques (such as by 
Ficoll/Hypaque density gradient centrifiigation of peripheral blood lymphocytes). T 
cells may be incubated in vitro for about 2-9 days (typically about 4 days) at 37°C with 
polypeptide (e.g., 5-25 ng/ml). It may be desirable to incubate another aliquot of a 
T cell sample in the absence of HSV polypeptide to serve as a control For CD4 + T 
cells, activation is preferably detected by evaluating proliferation of the T cells. For 
CD8* T cells, activation is preferably detected by evaluating cytolytic activity. A level 
of proliferation that is at least two fold greater and/or a level of cytolytic activity that is 
at least 20% greater than in disease-free patients indicates the presence of HSV in the 
patient. 

As noted above, HSV infection may also, or alternatively, be detected 
based on the level of mRNA encoding a HSV protein in a biological sample. For 
example, at least two oligonucleotide primers may be employed in a polymerase chain 
reaction (PCR) based assay to amplify a portion of a HSV cDNA derived from a 
biological sample, wherein at least one of the oligonucleotide primers is specific for 
(i.e., hybridizes to) a polynucleotide encoding the HSV protein. The amplified cDNA is 
then separated and detected using techniques well known in the art, such as gel 
electrophoresis. Similarly, oligonucleotide probes that specifically hybridize to a 
polynucleotide encoding a HSV protein may be used in a hybridization assay to detect 
the presence of polynucleotide encoding the HSV protein in a biological sample. 

To permit hybridization under assay conditions, oligonucleotide primers 
and probes should comprise an oligonucleotide sequence that has at least about 60%, 
preferably at least about 75% and more preferably at least about 90%, identity to a 
portion of a polynucleotide encoding a HSV protein that is at least 10 nucleotides, and 
preferably at least 20 nucleotides, in length. Preferably, oligonucleotide primers and/or 
probes hybridize to a polynucleotide encoding a polypeptide described herein under 
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moderately stringent conditions, as defined above. Oligonucleotide primers and/or 
probes which may be usefully employed in the diagnostic methods described herein 
preferably are at least 10-40 nucleotides in length. In a preferred embodiment, the 
oligonucleotide primers comprise at least 10 contiguous nucleotides, more preferably at 
least 15 contiguous nucleotides, of a DNA molecule having a sequence disclosed 
herein. Techniques for both PCR based assays and hybridization assays are well known 
in the art (see, for example, Mullis et al., Cold Spring Harbor Symp. Quant Biol., 
57:263, 1987; Erlich ed., PCR Technology, Stockton Press, NY, 1989). 

One preferred assay employs RT-PCR, in which PCR is applied in 
conjunction with reverse transcription. Typically, RNA is extracted from a biological 
sample, such as biopsy tissue, and is reverse transcribed to produce cDNA molecules. 
PCR amplification using at least one specific primer generates a cDNA molecule, which 
may be separated and visualized using, for example, gel electrophoresis. Amplification 
may be performed on biological samples taken from a test patient and from an 
individual who is not infected with HSV. The amplification reaction may be performed 
on several dilutions of cDNA, for example spanning two orders of magnitude. 

As noted above, to improve sensitivity, multiple HSV protein markers 
may be assayed within a given sample. It will be apparent that binding agents specific 
for different HSV polypeptides may be combined within a single assay. Further, 
multiple primers or probes may be used concurrently. The selection of HSV protein 
markers may be based on routine experiments to determine combinations that results in 
optimal sensitivity. In addition, or alternatively, assays for HSV proteins provided 
herein may be combined with assays for other known HSV antigens. 

The present invention further provides kits for use within any of the 
above diagnostic and/or therapeutic methods. Such kits typically comprise two or more 
components necessary for performing a diagnostic and/or therapeutic assay and will 
further comprise instructions for the use of said kit. Components may be compounds, 
reagents, containers and/or equipment. For example, one container within a diagnostic 
kit may contain a monoclonal antibody or fragment thereof that specifically binds to a 
HSV protein. Such antibodies or fragments may be provided attached to a support 
material, as described above. One or more additional containers may enclose elements, 
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such as reagents or buffers, to be used in the assay. Such kits may also, or alternatively, 
contain a detection reagent as described above that contains a reporter group suitable for 
direct or indirect detection of antibody binding. 

Alternatively, a kit may be designed to detect the level of mKNA 
encoding a HSV protein in a biological sample. Such kits generally comprise at least 
one oligonucleotide probe or primer, as described above, that hybridizes to a 
polynucleotide encoding a HSV protein. Such an oligonucleotide may be used, for 
example, within a PCR or hybridization assay. Additional components that may be 
present within such kits include a second oligonucleotide and/or a diagnostic reagent or 
container to facilitate the detection of a polynucleotide encoding a HSV protein. 

The following Examples are offered by way of illustration and not by 
way of limitation. 
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Example 1 
Identification of HSV-2 Antigens 

The following examples are presented to illustrate certain embodiments 
of the present invention and to assist one of ordinary skill in making and using the same. 
The examples are not intended in any way to otherwise limit the scope of the invention. 

Source of HSV-2 positive donors : Lymphocytes were obtained from two 
types of donors: Group A) seropositive donors with unknown clinical status, and Group 
B) seropositive donors with well characterized clinical status (viral shedding and ano- 
genital lesion recurrences). 

Group A: Blood samples (50 ml) were obtained from 13 potential 
donors. No infoixaation regarding clinical history of HSV-2 infection was requested. 
The blood was screened for serum antibody against HSV-1 and HSV-2 by Western blot. 
PBMCs were also screened for specific proliferative T cell responses to HSV-1 and 
HSV-2 lysate antigens (ABI; Columbia, MD). Three donors (AD104, AD116, and 
AD120) were positive for HSV-2 serum antibody and their PBMCs specifically 
proliferated in response to HSV-2 antigen. Leukopheresis PBMC were collected from 
these donors and cryopreserved in liquid nitrogen. 

Group B: Ano-genital lesion biopisies were obtained from donors 
DK21318 and JR5032. Lesion biopsy lymphocytes were expanded in vitro with DL-2 
and PHA in the presence of 50 uM acyclovir and subsequently cryopreserved in liquid 
nitrogen. Typically 5 x 10 6 to 5 x 10 7 lymphocytes are obtained after two weeks. 
Autologous PBMC were also collected from the blood of DK2318 and JR5032 and 
cryopreserved in liquid nitrogen. 

Generation of CD4+ T cell lines : Cryopreserved PBMCs or lesion- 
biopsy lymphocytes were thawed and stimulated in vitro with 1 ug/ml HSV-2 antigen 
(ABI) in RPMI 1640 + 10% human serum + 10 ng/ml IL-7. Irradiated autologous PBMC 
were added as antigen presenting cells for the lesion biopsy lymphocytes only. 
Recombinant IL-2 (1 ng/ml) was added on days 1 and 4. The cells were harvested, 
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washed, and replated in fresh medium containing IL-2 and IL-7 on day 7. Recombinant 
LL-2 was again added on day 10. The T cells were harvested, washed, and restimulated in 
vitro with HSV-2 antigen plus irradiated autologous PBMCin the same manner on day 
14 of culture. The T cell lines were cryopreserved at 1 x 10 7 cells/vial in liquid nitrogen 
on day 11-12 of the secondary stimulation. After thawing, the cryopreserved T cells 
retained the ability to specifically proliferate to HSV-2 antigen in vitro. These T cells 
were subsequently used to screen HSV-2 gene-fragment expression cloning libraries 
prepared in E. coli, as described below. 

Preparation of HSV-2 (333) DNA : HSV-2 strain 333 virus was grown in- 
Vero cells cultured in roller bottles in 200 ml/bottle of Medium 199 (Gibco) + 5% FCS. 
Vero cells are transformed African green monkey fibroblast-like cells that were 
obtained from ATCC (Cat. # CCL-81). Near-confluence Vero cells (10 roller bottles) 
were infected with HSV-2 strain 333 virus at an MOI of 0.01 in 50 ml/bottle of Medium 
199 + 1% FCS. Cells and medium were harvested from the roller bottles and the cells 
pelleted. The supernatant was saved on ice and the cell pellets were resuspended in 
fresh Medium 199+1% FCS and lysed by 6 cycles of freezing/thawing. The cell debris 
in the lysates was pelleted and the supernatant pooled with the saved culture 
supernatant. Virus was pelleted from the pooled supernatants by ultracentrifugation 
(12,000g, 2 hours, 4°C) and resuspended in 2 ml of fresh Medium 199 + 1% FCS. The 
virus was further purified on a 5 - 15% linear Ficoll gradient by ultracentrifugation 
(19,000 g, 2 hours, 4°C) as previously described (Chapter 10:Herpes simplex virus 
vectors of Molecular Virology: A Practical Approach (1993); Authors: F.J. Rixon and J. 
McClaughlan, Editors: A.J. Davison and R.M. Elliott; Publisher: Oxford University 
Press, Inc, New York, N. Y.). The HSV-2 virus-containing band was extracted from the 
gradient, diluted 10-fold with Medium 199, and the virus pelleted by ultracentrifugation 
at 19,000 g for 4 hours at 4°C. The virus pellet was recovered and resuspended in 10 ml 
of Tris/EDTA (TE) buffer. Intact virions were treated with DNAse and RNAse to 
remove cellular DNA and RNA. The enzymes were then inactivated by addition of 
EDTA and incubation at 65°C. DNA was prepared from the gradient-purified virus by 
lysis of the viral particles with SDS in the presence of EDTA, followed by 
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phenol/chlorform extraction to purify the genomic viral DNA. HSV-2 DNA was 
precipitated with EtOH and the DNA pellet was dried and resuspended in 1 ml of 
Tris/EDTA buffer. The concentration and purity of the DNA was determined by reading 
the OD 260 and OD 280 on a UV spectrophotometer. Genomic DNA prepared in this 
manner was used for construction of an HSV-2 genomic fragment expression library in 
E. coli. 

Construction of HSV-2 DNA fragment libraries in the pET17b vector : 
The HSV2-I library was constructed as follows. DNA fragments were generated by 
sonicating genomic HSV-2 DNA for 4 seconds at 15% output with a Fisher "60 
SonicDismembrator" (Fisher). The sonicated DNA was then precipitated, pelleted, and 
resuspended in 1 1 uL TE buffer. The approximate size of the DNA fragments was 
measured by agarose gel electropheresis of 1 uL of the fragmented HSV-2 genomic 
DNA vs. 1.5 ug unsonicated material. The average size of the DNA fragments was 
determined to be approx. 500 bp when visualized after ethidium bromide staining of the 
gel. Incomplete DNA fragment ends were filled in (blunted) usingT4 DNA polymerase. 
EcoRl adapters were then ligated to the blunt ends of the DNA fragments using T4 
DNA ligase. The DNA was then kinased using T4 Polynucleotide Kinase, purified using 
a manually loaded column of S-400-HR Sephacryl (Sigma) and ligated into the pET17b 
expression vector. The HSV2-II library was constructed in a similar fashion. The 
average size of inserts in this library was determined to be approximately lOOObp. 

Generation of the HSV-2 fragment expression library in E. coli. The 
HSV2-I library was transformed into E. coli for preparation of glycerol stocks and 
testing of HSV-2 DNA insert representation. The DNA was transformed into 
ElectroMAX DH10B E. coli (Gibco) in order to prepare a large quantity of HSV- 
2/pET17b library DNA. Transformed bacteria were grown up on 3 LB/Ampicillin plates 
(approx. 750 CFU/plate), a small subset of colonies were picked for sequencing of DNA 
inserts, and the remaining bacteria from each plate collected as a pool for preparation of 
plasrnid DNA. These pools were named HSV-2 Pools 9, 10 and 11. Glycerol stocks of a 
portion of these bacterial pools were stored at -80° C. Plasmids were purified from the 
remainder of the pools. Equal quantities of plasrnid DNA from each of the 3 pools was 
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combined to make a single pool of plasmid DNA. The tranformation efficiency of the 
pooled DNA was empirically determined using JM109(DE3) E. coli bacteria. 
JM109(DE3) bacteria were then transformed with an amount of the final pool of library 
DNA that was expected to yield 15 colony-forming units (CFU) per plate. The 
transformed bacteria were then plated on 100 LB/amp plates. Twenty CFU (on average) 
were actually observed on each of the 100 plates; therefore the pool size of this HSV-2 
library was about 20 clones/pool. The bacterial colonies were collected as a pool from 
each plate in approximately 800 ul/plate of LB + 20% glycerol. Each pool was 
distributed equally (200 ul/well) among four 96-well U-bottom plates and these "master 
stock" plates were stored at -80°C. The size of this HSV-2 gene-fragment library 
(hereafter referred to as HSV2I) was therefore 96 pools of 20 clones/pool. Plasmid 
DNA was prepared from 20 randomly picked colonies and the inserts sequenced. 
Approximately 15% (3/20) contained HSV-2 DNA as insert, 80% (16/20) contained 
non-HSV-2 DNA (E. coli or Vero cell DNA), and 5% (1/20) contained no insert DNA. 
The HSV2-II DNA library was transformed into E. coli and random colonies analyzed 
in a similar manner. Relevant differences in the construction of library HSV2-II 
included the transformation of the HSV-2/pET17b ligation product into NovaBlue 
(Novagen) chemically competent E. coli instead of using electroporation for preparation 
of a larger quantity of plasmid for pooling and transformation into JM109(DE3) 
bacteria for empirical evaluation. Additionally, plasmid DNA was prepared from 10 
pools averaging 160 colonies/plate. These 10 plasmid pools were combined in an 
equivalent fashion (normalized based on spectrophotometer readings) into one pool for 
transformation into JM109(DE3) as per previously, yielding an average of 20 
colonies(clones)/plate for harvesting into glycerol stock pools as before. Approximately 
25% contained HSV-2 DNA as insert, with the remaining 75% containing E. coli DNA 
as insert. 

Induction of the HSV-2 fragment expression library for screening with 
human CD4+ T cells . One of the master HSV2I library 96-well plates was thawed at 
room temperature. An aliquot (20 uL) was transferred from each well to a new 96 well 
plate containing 180 uL/well of LB medium + ampicillin. The bacteria were grown up 
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overnight and then 40 ul transferred into two new 96-well plates containing 160 uL 
2xYT medium + ampicillin. The bacteria were grown for 1 hr.15 min at 37°C. Protein 
expression was then induced by addition of IPTG to 200 mM. The bacteria were 
cultured for an additional 3 hrs. One of these plates was used to obtain 
spectrophotometer readings to normalize bacterial numbers/well. The second, 
normalized plate was used for screening with CD4+ T cells after pelleting the bacteria 
(approx. 2 x 10 7 /well) and removing the supernatants. The HSV2-H library was grown 
and induced in a similar fashion. 

Preparation of autologous dendritic APC's : Dendritic cells (DCs) were 
generated by culture of plastic-adherent donor cells (derived from 1 x 10 8 PBMC) in 6 
well plates (Costar 3506) in RPMI 1640 + 10% of a 1:1 mix of FCS:HS + 10 ng/ml 
GM-CSF + 1 0 ng/ml DL-4 at 37°C. Non-adherent DCs were collected from plates on day 
6 of culture and irradiated with 3300 Rads. The DCs were then plated at 1 x 10 4 /well in 
flat-bottom 96-well plates (Costar 3596) and cultured overnight at 37°C. The following 
day, the DCs were pulsed with the induced HSV2-I or HSV2-II library pools by 
resuspending the bacterial pellets in 200 ul RPMI 1640+ 10 %FCS without antibiotics 
and transferring 10 ul/well to the wells containing the DCs in 190ul of the same 
medium without antibiotics. The DCs and bacteria were co-cultured for 90 minutes at 
37°C. The DCs were then washed and resuspended in 100 ul/well RPMI 1640 + 10% 
HS + L-glut. + 50 ug/ml gentamicin antibiotic. 

Preparation of responder T cells : Cryopreserved CD4+ T cell lines were 
thawed 5 days before use and cultured at 37°C in RPMI 1640 + 10% HS + 1 ng/ml DL-2 
+ 10 ng/ml BL-7. After 2 days, the medium was replaced with fresh medium without DLr 
2andIL-7. 

Primary screening of the HSV2 libraries : The T cells were resuspended 
in fresh RPMI 1640 + 10% HS and added at 2 x 10 4 /well to the plates containing the E. 
coli-pulsed autologous DCs. After 3 days, 100 ul/well of supernatant was removed and 
transferred to new 96 well plates. Half of the supernatant was subsequently tested for 
IFN-gamma content by ELISA and the remainder was stored at -20°C. The T cells were 
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then pulsed with 1 uCi/well of [3H]-Thymidine (Amersham/Pharmacia; Piscataway, 
NJ) for about 8 hours at 37°C. The 3H-pulsed cells were then harvested onto UniFilter 
GF/C plates (Packard; Downers Grove, IL) and the CPM of [3H]-incorporated 
subsequently measured using a scintillation counter (Top-Count; Packard). ELISA 
assays were performed on cell supernatants following a standard cytokine-capture 
ELISA protocol for human IFN-g. 

From the HSV2-I library screening with T cells from D104, wells 
HSV2I_H10 and HSV2IJH12, for which both CPM and IFN-g levels were significantly 
above background, were scored as positive. 

Breakdown of positive HSV2I library pools : The positive wells 
(HSV2IJH10 and HSV2I_H12) from the initial CD4+ T cell screening experiment were 
grown up again from the master glycerol stock plate. Forty-eight sub-clones from each 
pool were randomly picked, grown up and BPTG-induced as described previously. The 
subclones were screened against the AD 104 CD4+ T cell line as described above. A 
clone (HSV2IJH12A12) from the HSV2I_H12 pool breakdown scored positive. This 
positive result was verified in a second AD 104 CD4+ T cell assay. 

Identification of UL39 as a CD4+ T cell antigen : The positive clone 
(HSV2I_H12A12) was subcloned and 10 clones picked for restriction digest analysis 
with EcoRI NB#675 pg. 34. All 10 clones contained DNA insert of the same size 
(approximately 900 bp in length). Three of these clones (HSV2IJH12A12J, 7, and 8) 
were chosen for sequencing and all contained identical insert sequences at both the 5' 
and V ends of the inserts. The DNA sequence of the insert is set forth in SEQ ID NO:l, 
and contains an open reading frame set forth in SEQ ID NO:2. The insert sequence was 
compared to the complete genomic sequence of HSV-2 strain HG52 (NCBI site, 
Accession #Z86099) and the sequence was determined have a high degree of homology 
to UL39 (a.k.a. ICP6), the large subunit (140 kD) of the HSV ribonucleotide reductase, 
the sequence of which is set forth in SEQ ID NO:3. The insert sequence set forth in 
SEQ ID NO: 1 spans nucleotides 876 - 1690 of the UL39 open reading frame (3,432 
bp) and encodes the amino acid sequence set forth in SEQ ID NO:2, which has a high 
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degree of homology to amino acids 292 - 563 of UL39 (full length = 1 143 aa). 

Identification of US8A. US3/US4, UL15. UL18, UL27 and UL46 as 
CD4+ T cell antigens : In a manner essentially identical to that described above for the 
identification of UL39 as a T cell antigen, an additional HSV2 gene fragment 
expression cloning library, referred to as HSV2-II, was prepared, expressed in E. coli, 
and screened with donor T cells. 

Screening the HSV2-II library with T cells from donor AD1 16 identified 
the clone HSV2n_US8AfragD6.B_Bll_T7Trc.seq, determined to have an insert 
sequence set forth in SEQ ID NO:4, encoding open reading frames having amino acid 
sequences set forth in SEQ ID NO:5 and 6, with the sequence of SEQ ID NO:5 having a 
high degree of homology with the HSV2 US8A protein, the sequence of which is set 
forth in SEQ ID NO:7. 

In addition, screening the HSV2-II library with T cells from donor 
AD 104 identified the following clone inserts: 

SEQ ID NO:8, corresponding to clone HSV2IIJJS3/US4 
fragF10B3_T7Trc.seq, containing a potential open reading frame having an amino acid 
sequence set forth in SEQ ID NO: 10; 

SEQ ID NO:9, corresponding to clone HSV2IMJS3/US4 
fragF10B3_T7P.seq, containing an open reading frame having an amino acid sequence 
set forth in SEQ ID NO: 1 1, sharing a high degree of homology with the HSV-2 US3 
protein (SEQ ID NO: 12); 

SEQ ID NO: 13, corresponding to clone 
HSV2njJL46fragFllF5JT7Trc.seq, containing an open reading frame having an 
amino acid sequence set forth in SEQ ID NO: 14, sharing a high degree of homology 
with the HSV-2 UL46 protein (SEQ ID NO: 15); 



SEQ ID NO: 16, corresponding to clone HSV2n_UL27frag- 
H2C7_T7Trc.seq, containing an open reading frame having an amino acid sequence set 
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forth in SEQ ID NO:17, sharing a high degree of homology with the HSV-2 UL27 
protein (SEQ ID NO: 1 8); 

SEQ ID NO: 19, corresponding to clone 
HSV2njUL18fragF10Al_rc.seq, containing open reading frames having amino acid 
sequences set forth in SEQ ID NO:20, 21 and 22, with SEQ ID NO:22 sharing a high 
degree of homology with the HSV-2 UL1 8 protein (SEQ ID NO: 23); and 

SEQ ID NO:24, corresponding to clone 
HSV2II_UL15fragF10A12_rc.seq, containing an open reading frame having an amino 
acid sequence set forth in SEQ ID NO: 25, sharing a high degree of homology with the 
HSV-2 UL15 protein (SEQ ID NO: 26), 

EXAMPLE 2 
Identification of HSV-2 Antigens 
CD4 + T cells from AD104 were found to recognize inserts from clones 
HSV2n_UL46fragFllF5_T7Trc.seq (SEQ ID NO: 13) and 
HSV2njJL18frgaF10Aljx.seq (SEQ ID NO: 19) as described in detail in Example 1. 
The sequences from these clones share a high degree of homology to the HSV2-I genes, 
UL46 (SEQ IE) NO: 15) and UL18 (SEQ ID NO:23), respectively. Therefore to further 
characterize the epitopes recognized by these T cells, overlapping 15-mer peptides were 
made across the clone insert fragments of UL18 and UL46. Peptide recognition by 
AD104's CD4+ T cells was tested in a 48 hour IFN-g ELISPOT assay. ELISPOTS 
were performed by adding IxlO 4 autologous EBV-transformed B cells (LCL) or DCs 
per well in 96 well ELISPOT plates. 2xl0 4 AD104 CD4+ T cells from AD104's line 
were added per well with 5\ig/m\ of the HSV2 peptides. AD 104 CD4+ T cells 
recognized peptides 20 and 21 (SEQ ID NO: 32 and 33) of UL18, and peptides 1, 4, 9, 
10, and 20 of UL46 (SEQ ID NO: 27-31). 



WO 02/02131 



PCT7US01/20981 



105 

EXAMPLE 3 
Identification of HSV-2 Antigens 

CD4+ T cell lines were generated from DK2318 and JR5032 lesion- 
biopsy. The CD4+ lymphocytes were stimulated twice in vitro on irradiated autologous 
PBMC and HSV2 antigen as described in example 1. The lines were tested for their 
antigen specificity as described in example 1 and cryopreserved. The CD4+ T cell lines 
were screened against the HSV2-II expression-cloning library generated in Example 1. 

DK2318 was shown to react with clones C12 and G10. Clone CI 2 was 
determined to have an insert sequence set forth in SEQ ID NO:36. This insert was 
found to have sequence homology with fragments of 2 HSV-II genes, nucleotides 723- 
1311 of UL23 and nucleotides 1-852 of UL22. These sequences correspond to amino 
acids 241-376 of UL23 as set forth in SEQ ID NO:40 and amino acids 1-284 as set forth 
in SEQ ID NO:4L The DNA sequence of SEQ ID NO:36 was searched against public 
databases including Genbank and shown to have a high degree of sequence homology to 
the HSV2 genes UL23 and UL22 set forth in SEQ ID NO:37 and 38 respectively. The 
protein sequences encoded by SEQ ID NO:37 and 38 are set forth in SEQ ID NO:39 
and 45. Clone G10 was determined to have an insert sequence which is set forth in 
SEQ ID NO:48, encoding open reading frames having an amino acid sequence set forth 
in SEQ ID NO:50, with the sequence of SEQ ID NO:48 having a high degree of 
sequence homology with HSV2 UL37, the sequence of which is set forth in SEQ ID 
NO:49, encoding open reading frames having the amino acid sequences set forth in 
SEQ ID NO:5L DK2318's CD4+ T cell line was screened against overlapping 15 mers 
covering the UL23 protein. DK2318's CD4 line was shown to react against three UL23 
specific peptides (SEQ ID NO:41-43) suggesting that UL23 is a target. 

The CD4+ T cell line generated from JR5032 was found to react with 
clone E9 which contained an insert sequence set forth in SEQ ID NO: 34, encoding 
open reading frames having amino acid sequences set forth in SEQ ID NO: 46, with 
SEQ ID NO: 34 having a high degree of sequence homology with HSV2 RL2 (also 
referred to as ICPO), the sequence of which is set forth in SEQ ID NO:35, encoding an 
open reading frame having the amino acid sequences set forth in SEQ ID NO:47. 
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EXAMPLE 4 

Characterization of CD4 Clones Fl 1F5 and G10A9 
Examples 2 and 3 describe the generation of CD4 T cell lines from 
donors AD 104 and DK2313 which were screened against cDNA libraries generated 
using the HSV-2 333 strain. AD 104 was found to react against the clone 
HSV2II_UL46fragFllF5. This insert was partially sequenced with the sequence being 
disclosed in SEQ ED NO:13. Full length sequencing of the insert revealed that it 
encoded a fragment of UL46 which was derived from the HSV-2 333 strain. The DNA 
and amino acid sequences from this insert are disclosed in SEQ ID NO:52 and 54, 
respectively. 

DK2312 was found to react against the clone G10. This insert was 
partially sequenced and the sequence was disclosed in SEQ ID NO:48. Full length 
sequencing revealed that it encoded a fragment of UL37 which was derived from the 
HSV-2 333 strain. The DNA and amino acid sequences from this insert are disclosed in 
SEQ ID NO:53 and 55, respectively. 

EXAMPLE 5 

Identification Of CD8-Specific Immunoreactive Peptides Derived From HSV-2. 

Peripheral blood mononuclear cells were obtained from the normal 
donors AD104, AD1 16, AD120, and D477. These donors were HLA typed using low- 
resolution DNA-typing methodology and the results are presented in Table 2. 



Table 2 



DONOR 


AD104 


AD116 


AD120 


D477 


HLA-A 


24, 33 


0206, 24 


0211,3303 


0201,2501 


HLA-B 


45, 58 


0702, 35 


1505,4403 


1501,5101 


HLA-C 


01, 0302 


0702, 1203 


0303, 0706 


0304, 12 



In order to determine which epitopes of HSV-2 were immunoreactive, 
synthetic peptides were synthesized. These peptides were 15 amino acids in length 
overlapping by 1 1 amino acids. The peptides were synthesized across the following 
regions of the following HSV-2 genes: UL15 (aa 600-734), UL18 (aa 1-110), UL23 (aa 
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241-376), UL46 (aa 617-722), US3 (aal25-276), and US 8 A (aa 83-146). 

CD8 + T cells were purified from the PBMC of each of the donors 
described above using negative selection. The purified CD8+ T cells were then tested 
for their reactivity against the HSV-2 specific peptides. Co-cultures containing 2xl0 5 
CD8 + T cells, lxl 0 4 autologous dendritic cells and lOjag/ml of a peptide pool (on 
average containing 10 peptides/pool) were established in 96 well ELISPOT plates that 
had been pre-coated with anti-human IFN-y antibody (1D1K: mAbTech). After 24 
hours, the ELISPOT plates were developed using a standard protocol well known to one 
of skill in the art. The number of spots per well were then counted using an automated 
video microscopy ELISPOT plate reader. CD8+ T cells from donors demonstrating a 
positive response against a peptide pool were then subsequently tested against the 
individual peptides in that pool in a second ELISPOT assay. The results of peptide 
reactivity are presented in Table 3. 



Table 3. 



Donor 


HSV-2 Gene 


Peptide # (amino 


SEQIDNO 






acid numbering) 




AD104 


US3 


#33 (262-276) 


63 


AD116 


UL15 


#23(688-702) 


56 






#30(716-730) 


57 




UL23 


#7(265-279) 


58 




UL46 


#2(621-635) 


59 






#8(645-659) 


60 






#9(649-663) 


61 






#11(657-671) 


62 




US8A 


#5(99-113) 


64 


AD120 


UL46 


Peptides: #1-12 




D477 


UL18 


Peptides: #1-12 






UL23 . 


Peptides: #1-20 






UL46 


Peptides: #1-12 
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Those skilled in the art will appreciate that the conceptions and specific 
embodiments disclosed in the foregoing description may be readily utilized as a basis 
for modifying or designing other embodiments for carrying out the same purposes of the 
present invention. Those skilled in the art will also appreciate that such equivalent 
embodiments do not depart from the spirit and scope of the invention as set forth in the 
appended claims. 
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Claims 

What is Claimed: 

1. An isolated polypeptide comprising at least an immunogenic 
portion of an HSV antigen, wherein said antigen comprises an amino acid sequence set 
forth in any one of SEQ ID NO: 2-3, 5-7, 10-12, 14-15, 17-18, 20- 23, 25-26, 39-41, 45- 
47, 50-51, and 54-55. 

2. An isolated polynucleotide encoding a polypeptide of claim 1 . 

3. An isolated polynucleotide of claim 2, wherein said 
polynucleotide comprises a sequence set forth in any one of SEQ ID NO: 1, 4, 8-9, 13, 
16, 19,24, 34-38, 48-49, and 52-53. 

4. An isolated polypeptide comprising at least an immunogenic 
portion of a HSV UL46 antigen, wherein said antigen comprises an amino acid 
sequence set forth in SEQ ID NO: 15, and wherein said immunogenic portion is 
selected from the group consisting of SEQ ID NO: 27-33 and 59-62. 

5. An isolated polypeptide comprising at least an immunogenic 
portion of a HSV UL15 antigen, wherein said antigen comprises an amino acid 
sequence, set forth in SEQ ID NO: 26, and wherein said immunogenic portion is 
selected from the group consisting of SEQ ID NO: 56-57. 

6. An isolated polypeptide comprising at least an immunogenic 
portion of a HSV US3 antigen, wherein said antigen comprises an amino acid sequence 
set forth in SEQ ID NO: 12, and wherein said immunogenic portion comprises SEQ ID 
NO: 63. 

7. An isolated polypeptide comprising at least an immunogenic 
portion of a HSV US8A antigen, wherein said antigen comprises an amino acid 
sequence set forth in SEQ ID NO: 7, and wherein said immunogenic portion comprises 
SEQ ID NO: 64. 
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8. A fusion protein comprising a polypeptide according to claim 1 
and a fusion partner. 

9. A fusion protein according to claim 8, wherein the fusion partner 
comprises an expression enhancer that increases expression of the fusion protein in a 
host cell transfected with a polynucleotide encoding the fusion protein. 

10. A fusion protein according to claim 8, wherein the fusion partner 
comprises a T helper epitope that is not present within the polypeptide of claim 1 . 

11. A fusion protein according to claim 8, wherein the fusion partner 
comprises an affinity tag. 

.12. An isolated polynucleotide encoding a fusion protein according 

to claim 8. 

13. An isolated monoclonal or polyclonal antibody, or antigen- 
binding fragment thereof, that specifically binds to a polypeptide of claim 1 . 

14. A pharmaceutical composition comprising a polypeptide 
according to claim 1 or a polynucleotide encoding said polypeptide, and a 
physiologically acceptable carrier. 

15. A pharmaceutical composition comprising a polypeptide 
according to claim 1, or a polynucleotide encoding said polypeptide, and an 
immunostimulant. 

16. The pharmaceutical composition of claim 15, wherein the 
immunostimulant is selected from the group consisting of a monophosphoryl lipid A, 
aminoalkyl glucosaminide phosphate, saponin, or a combination thereof. 



17. A method for stimulating an immune response in a patient, 
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comprising administering to a patient a pharmaceutical composition according to any 
one of claims 14-16. 



1 9. The method according to claim 1 8, wherein the biological sample 
is selected from the group consisting of whole blood, serum, plasma, saliva, 
cerebrospinal fluid and urine. 

20. A method for detecting HSV infection in a biological sample, 

comprising: 

(a) contacting the biological sample with a binding agent which is 
capable of binding to a polypeptide according to claim 1 ; and 

(b) detecting in the sample a polypeptide that binds to the binding 
agent, thereby detecting HSV infection in the biological sample. 

21. The method of claim 20, wherein the binding agent is a 
monoclonal antibody. 

22. The method of claim 20, wherein the binding agent is a 
polyclonal antibody. 



(a) 
(b) 



A method for detecting HSV infection in a patient, comprising: 

obtaining a biological sample from the patient; 

contacting the sample with a polypeptide according to claim 1; 



and 



(c) 



detecting the presence of antibodies that bind to the polypeptide. 



23. The method of claim 20 wherein the biological sample is selected 
from the group consisting of whole blood, sputum, serum, plasma, saliva, cerebrospinal 
fluid and urine. 
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24. A diagnostic kit comprising a component selected from the group 

consisting of: 

(a) a polypeptide according to claim 1 ; 

(b) a fusion protein according to claim 8; 

(c) at least one antibody, or antigen-binding fragment thereof, 
according to claim 13; and 

(d) a detection reagent. 

25. The kit according to claim 24, wherein the polypeptide is 
immobilized on a solid support. 

26. The kit according to claim 24, wherein the detection reagent 
comprises a reporter group conjugated to a binding agent. 

27. The kit of claim 26, wherein the binding agent is selected from 
the group consisting of antiimmunoglobulins, Protein G, Protein A and lectins. 

28. The kit of claim 26, wherein the reporter group is selected from 
the group consisting of radioisotopes, fluorescent groups, luminescent groups, enzymes, 
biotin and dye particles. 

29. A method for treating HSV infection in a patient, comprising the 

steps of: 

(a) obtaining peripheral blood cells from the patient; 

(b) incubating the cells in the presence of at least one polypeptide 
according to claim 1, such that T cells proliferate; and 

(c) administering to the patient the proliferated T cells. 

30. The method of claim 29, wherein the step of incubating the T 
cells is repeated one or more times. 



31. The method of claim 29, wherein step (a) further comprises 
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separating T cells from the peripheral blood cells, and the cells incubated in step (b) are 
the T cells. 

32. The method of claim 29, wherein step (a) further comprises 
separating CD4+ cells or CD8+ T cells from the peripheral blood cells, and the cells 
proliferated in step (b) are CD4+ or CD8+ T cells. 

33. The method of claim 29, wherein step (a) further comprises 
separating gamma/delta T lymphocytes from the peripheral blood cells, and the cells 
proliferated in step (b) are gamma/delta T lymphocytes. 

34. The method of claim 29, wherein step (b) further comprises 
cloning one or more T cells that proliferated in the presence of the polypeptide. 

35. A pharmaceutical composition for the treatment of HSV infection 
in a patient, comprising T cells proliferated in the presence of a polypeptide of claim 1, 
in combination with a physiologically acceptable carrier. 

36. A method for treating HSV infection in a patient, comprising the 

steps of: 

(a) incubating antigen presenting cells in the presence of at least one 
polypeptide of claim 1; 

(b) administering to the patient the incubated antigen presenting 

cells. 

37. The method of claim 36, wherein the antigen presenting cells are 
selected from the group consisting of dendritic cells, macrophage cells, B cells 
fibroblast cells, monocyte cells, and stem cells. 

38. A pharmaceutical composition for the treatment of HSV infection 
in a patient, comprising antigen presenting cells incubated in the presence of a 
polypeptide of claim 1, in combination with a physiologically acceptable carrier. 
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SEQUENCE LISTING 

<110> Corixa Corporation 
Hosken, Nancy Ann 
Day, Craig 
Dillon Davin C. 
McGowan, Patrick 
Sleath, Paul R. 

<120> COMPOSITIONS AND METHODS FOR THE DIAGNOSIS AND 
TREATMENT OF HERPES SIMPLEX VIRUS INFECTION 

<130> 210121. 538PC 

<140> PCT 

<141> 2001-06-28 

<160> 64 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 815 
<212> DNA 

<213> Herpes simplex virus 
<400> 1 

ccacgccgcc gcaccccagg cggacgtggc gccggttctg gacagccagc ccactgtggg 60 

aacggacccc ggctacccag tccccctaga actcacgccc gagaacgcgg aggcggtggc 120 

gcggtttctg ggggacgccg tcgaccgcga gcccgcgctc atgctggagt acttctgtcg 180 

gtgcgcccgc gaggagagca agcgcgtgcc cccacgaacc ttcggcagcg ccccccgcct 240 

cacggaggac gactttgggc tcctgaacta cgcgctcgct gagatgcgac gcctgtgcct 300 

ggaccttccc ccggtccccc ccaacgcata cacgccctat catctgaggg agtatgcgac 360 

gcggctggtt aacgggttca aacccctggt gcggcggtcc gcccgcctgt atcgcatcct 420 

ggggattctg gttcacctgc gcatccgtac ccgggaggcc tcctttgagg aatggatgcg 480 

ctccaaggag gtggacctgg acttcgggct gacggaaagg cttcgcgaac acgaggccca 540 

gctaatgatc ctggcccagg ccctgaaccc ctacgactgt ctgatccaca gcaccccgaa 600 

cacgctcgtc gagcgggggc tgcagtcggc gctgaagtac gaagagtttt acctcaagcg 660 

cttcggcggg cactacatgg agtccgtctt ccagatgtac acccgcatcg ccgggttcct 720 

ggcgtgccgg gcgacccgcg gcatgcgcca catcgccctg gggcgacagg ggtcgtggtg 780 

ggaaatgttc aagttctttt tccaccgcct ctacg 815 

<210> 2 
<211> 271 
<212> PRT 

<213> Herpes simplex virus 
<400> 2 

His Ala Ala Ala Pro Gin Ala Asp Val Ala Pro Val Leu Asp Ser Gin 

15 10 15 

Pro Thr Val Gly Thr Asp Pro Gly Tyr Pro Val Pro Leu Glu Leu Thr 

20 25 30 

Pro Glu Asn Ala Glu Ala Val Ala Arg Phe Leu Gly Asp Ala Val Asp 

35 40 45 

Arg Glu Pro Ala Leu Met Leu Glu Tyr Phe Cys Arg Cys Ala Arg Glu 

50 55 60 

Glu Ser Lys Arg Val Pro Pro Arg Thr Phe Gly Ser Ala Pro Arg Leu 
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65 








70 


75 


80 


Thr 


Glu 


Asp 


Asp 




T.on Ti^n flen T\7r* Z\l a 
ijc U UCU noli iyx 


LiCU rxXel UJLU Lie L. ni y 










85 






™- y 


Leu 


Cys 


Leu 




riu nu vdi trxtJ nu 


Hen TV 1 a Tt/y* TViv Prn 
rloll rtld X yi 1I1X ir.HJ 








100 




1 OR 


xxvs 


Tvr 


His 


Leu 


Arg 


Glu Tyr 


Aid J.iix. ni y jjcu vdj. 


nan uiy irxic xiya rxu 






115 






120 


1 PR 

X£*J 


Leu 


Val 


Tim 


aj. y 


Gar Zil a 


mx y ijcu lyr iiry xxe 


Ton PI if Tin Ton \7s» 1 

xieu \aJ-Y -Lxe lieu vai 




130 










Xft v/ 


His 


Leu. 




Tl *a 


rax y J. III. 


ZVv/t Cln &1=> Car* DV> a 
rVxy blU nJ.a oei Iriie 


uiu uiu irp riex. jfiry 


145 








150 




i fin 


Ser 




G1 ii 


Vol 




A on Dha ml t f T on TV* v~ 

nsp true oxy Leu ±iir 


uiu firg jjeu Arg uxu 










165 


1 7(1 


1 7R 


His 


Glu 


Ala 


Gin 


T.pii Me>1- 
uc la ric L. 


■Lie UCU. rxJLcx ulll nld 


T.on 7\ <-• T-i Prn T \T~y Zi on 

iicU nsn rro j.yx nsp 








180 




185 


190 


Cys 


Leu 


He 


His 


Ser Thr 


Pro Asn Thr Leu Val 


Glu Arg Gly Leu Gin 






195 






200 


205 


Ser 


Ala 


Leu 


Lys 


Tyr Glu 


Glu Phe Tyr Leu Lys 


Arg Phe Gly Gly His 




210 








215 


220 


Tyr 


Met 


Glu 


Ser 


Val Phe 


Gin Met Tyr Thr Arg 


He Ala Gly Phe Leu 


225 








230 


235 


240 


Ala 


Cys 


Arg 


Ala 


Thr Arg 


Gly Met Arg His He 


Ala Leu Gly Arg Gin 










245 


250 


255 


Gly 


Ser 


Trp 


Trp 


Glu Met 


Phe Lys Phe Phe Phe 


His Arg Leu Tyr 








260 




265 


270 



<210> 3 
<211> 1142 
<212> PRT 

<213> Herpes simplex virus 



<400> 3 



Met 


Ala. 


Asn 


Arg 


Pro 


Ala 


Ala Ser Ala Leu Ala 


Gly Ala Arg Ser Pro 


1 








5 




10 


15 


Ser 


Glu 


Arg 


Gin 


Glu 


Pro 


Arg Glu Pro Glu Val 


Ala Pro Pro Gly Gly 








20 






25 


30 


Asp 


His 


Val 


Phe 


Cys 


Arg 


Lys Val Ser Gly Val 


Met Val Leu Ser Ser 






35 








40 


45 


Asp 


Pro 


Pro 


Gly 


Pro 


Ala 


Ala Tyr Arg He Ser 


Asp Ser Ser Phe Val 




50 










55 


60 


Gin 


Cys 


Gly 


Ser 


Asn 


Cys 


Ser Met He He Asp 


Gly Asp Val Ala Arg 


65 










70 


75 


80 


Gly 


His 


Leu 


Arg 


Asp 


Leu 


Glu Gly Ala Thr Ser 


Thr Gly Ala Phe Val 










85 




90 


95 


Ala 


lie 


Ser 


Asn 


Val 


Ala 


Ala Gly Gly Asp Gly 


Arg Thr Ala Val Val 








100 






105 


110 


Ala 


Leu 


Gly 


Gly 


Thr 


Ser 


Gly Pro Ser Ala Thr 


Thr Ser Val Gly Thr 






115 








120 


125 


Gin 


Thr 


Ser 


Gly 


Glu 


Phe 


Leu His Gly Asn Pro 


Arg Thr Pro Glu Pro 




130 










135 


140 


Gin 


Gly 


Pro 


Gin 


Ala 


Val 


Pro Pro Pro Pro Pro 


Pro Pro Phe Pro Trp 


145 










150 


155 


160 


Gly 


His 


Glu 


Cys 


Cys 


Ala 


Arg Arg Asp Ala Arg 


Gly Gly Ala Glu Lys 










165 




170 


175 


Asp 


Val 


Gly 


Ala 


Ala 


Glu 


Ser Trp Ser Asp Gly 


Pro Ser Ser Asp Ser 








180 






185 


190 


Glu 


Thr 


Glu 


Asp 


Ser 


Asp 


Ser Ser Asp Glu Asp 


Thr Gly Ser Glu Thr 






195 








200 


205 
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Leu 


Ser 


Arg 


Ser 


Ser 


£>er lie lrp Ala Ala Gly 


Ala Thr Asp Asp Asp 














0 on 
zzu 


Asp 


Ser 


Asp 


Ser 


Asp 


Ser Arg Ser Asp Asp Ser 


val bin Pro Asp vai 












230 235 


240 


vai 


va± 


Arg 


Arg 


Arg 


Trp Ser Asp Gly Pro Ala 


Pro vai Ala pne rro 










0 a c; 






Lys 


Pro 


Arg 


Arg 


Pro 


Gly Asp Ser Pro Gly Asn 


Pro Gly Leu Gly Ala 








2oU 




2o5 


0 1 n 


bly 


rpi_ „ 

inr 


bly 


Pro 


biy 


Ser Ala Thr Asp Pro Arg 


Ala Ser Ala Asp Ser 






2/5 






2oU 


2o5 


TV 

Asp 


Ser 


TV "V 

Ala 


TV 1 _ 

Ala 


His 


Ala Ala Ala Pro Gin Ala 


T\ — TT— T TV T a T~V_~— » TT— T 

Asp Val Ala Pro Val 




o on 
290 








one 
295 


3U0 


lieu 


Asp 


Ser 


Gin 


Pro 


Thr Val Gly Thr Asp Pro 


Gly Tyr Pro Val Pro 


one 










O "1 f\ 0 1 c 

31U 315 


0 0 rv 


Leu 


GlU 


Leu 


Thr 


Pro 


blu Asn Ala Glu Ala Val 


Ala Arg Phe Leu Gly 












■"ion 
33U 


oJb 


Asp 


TV 1 

Ala 


val 


Asp 


TV -w- 

Arg 


Glu Pro Ala Leu Met Leu 


Glu Tyr Phe Cys Arg 








34 U 




345 


OCA 

35U 


Cys 


Ala 


Arg 


blU 


blu 


Ser Lys Arg Val Pro Pro 


Arg Thr Phe Gly Ser 






"5 n; c 








3b5 


TV 1 -» 

Ala 


Fro 


Arg 


Leu 


inr 


Glu Asp Asp Phe Gly Leu 


Leu Asn Tyr Ala Leu 




0 o a 








3 / 5 


0 0 r\ 
3o0 


Ala 


blU 


Met 


Arg 


Arg 


Leu Cys Leu Asp Leu Pro 


Pro Val Pro Pro Asn 


Job 










390 395 


a rvrv 
400 


Ala 


Tyr 


inr 


Pro 


Tyr 


His Leu Arg Glu Tyr Ala 


Thr Arg Leu Val Asn 










4U5 


a 1 rv 
4 1U 


ill c 

415 


Gly 


Pne 


Lys 


Pro 


T ^» , - 

Leu 


Val Arg Arg Ser Ala Arg 


Leu Tyr Arg He Leu 








a 0 r\ 
4 20 




A O C 

425 


430 


Gly 


Val 


Leu 


Val 


His 


Leu Arg He Arg Thr Arg 


Glu Ala Ser Phe Glu 






A O C 

435 






A A f\ 

440 


A A C 

445 


Glu 


Trp 


Met 


Arg 


Ser 


Lys Glu Val Asp Leu Asp 


Phe Gly Leu Thr Glu 




45U 








A CC 

455 


4d0 


Arg 


T ~ ... 

Leu 


TV v. 

Arg 


Glu 


His 


Glu Ala Gin Leu Met He 


Leu Ala Gin Ala Leu 


A £. C 

465 










x *t rv j n r 

470 475 


480 


Asn 


Pro 


Tyr 


Asp 


Cys 


Leu He His Ser Thr Pro 


Asn Thr Leu Val Glu 










/IOC 

485 


a cvrv 
490 


A fiC 

495 


Arg 


Gly 


Leu 


Gin 


Ser 


Ala Leu Lys Tyr Glu Glu 


Phe Tyr Leu Lys Arg 








500 




505 


510 


rne 


bly 


Gly 


HIS 


Tyr 


Met Glu Ser Val Pne Gin 


Met Tyr Thr Arg He 






OlO 








dZd 


Ala 


bly 


Fne 


Leu 


TV 1 

Ala 


Cys Arg Ala Thr Arg Gly 


Met Arg His He Ala 












535 


C A f\ 

54 U 


Leu 


bly 


TV 

Arg 


bin 


bly 


Ser Trp Trp Glu Met Phe 


Lys Pne Pne Pne His 


C. / K 
545 










OOU 555 


56U 


Arg 


Leu 


Tyr 


Asp 


His 


Gin He Val Pro Ser Tnr 


Pro Ala Met Leu Asn 










boo 


5 /U 


c o c 

5 /5 


Leu 


Gly 


Thr 


Arg 


TV _ — 

Asn 


Tyr Tyr Thr Ser Ser Cys 


Tyr Leu Val Asn Pro 








580 




585 


590 


Gin 


Ala 


Thr 


Thr 


Asn 


Gin Ala Thr Leu Arg Ala 


He Thr Gly Asn Val 






595 






600 


605 


Ser 


Ala 


He 


Leu 


Ala 


Arg Asn Gly Gly He Gly 


Leu Cys Met Gin Ala 




610 








615 


620 


Phe 


Asn 


Asp 


Ala 


Ser 


Pro Gly Thr Ala Ser He 


Met Pro Ala Leu Lys 


625 










630 635 


640 


Val 


Leu 


Asp 


Ser 


Leu 


Val Ala Ala His Asn Lys 


Gin Ser Thr Arg Pro 










645 


650 


655 


Thr 


Gly 


Ala 


Cys 


Val 


Tyr Leu Glu Pro Trp His 


Ser Asp Val Arg Ala 








660 




665 


670 
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Val Leu Arg Met Lys Gly Val Leu Ala Gly Glu Glu Ala Gin Arg Cys 

675 680 685 

Asp Asn lie Phe Ser Ala Leu Trp Met Pro Asp Leu Phe Phe Lys Arg 

690 695 700 

Leu lie Arg His Leu Asp Gly Glu Lys Asn Val Thr Trp Ser Leu Phe 
705 710 715 720 

Asp Arg Asp Thr Ser Met Ser Leu Ala Asp Phe His Gly Glu Glu Phe 

725 730 735 

Glu Lys Leu Tyr Glu His Leu Glu Ala Met Gly Phe Gly Glu Thr lie 

740 745 ~ 750 

Pro lie Gin Asp Leu Ala Tyr Ala He Val Arg Ser Ala Ala Thr Thr 

755 760 765 

Gly Ser Pro Phe He Met Phe Lys Asp Ala Val Asn Arg His Tyr He 

770 775 • 780 

Tyr Asp Thr Gin Gly Ala Ala He Ala Gly Ser Asn Leu Cys Thr Glu 
785 790 795 800 

He Val His Pro Ala Ser Lys Arg Ser Ser Gly Val Cys Asn Leu Gly 

805 810 815 

Ser Val Asn -Leu Ala Arg Cys Val Ser Arg Gin Thr Phe Asp Phe Gly 

820 825 830 

Arg Leu Arg Asp Ala Val .Gin Ala Cys Val Leu Met Val Asn He Met 

835 840 845 

He Asp Ser Thr Leu Gin Pro Thr Pro Gin Cys Thr Arg Gly Asn Asp 

850 855 860 

Asn Leu Arg Ser Met Gly He Gly Met Gin Gly Leu His Thr Ala Cys 
865 870 875 880 

Leu Lys Met Gly Leu Asp Leu Glu Ser Ala Glu Phe Arg Asp Leu Asn 

885 890 895 

Thr His He Ala Glu Val Met Leu Leu Ala Ala Met Lys Thr Ser Asn 

900 905 910 

Ala Leu Cys Val Arg Gly Ala Arg Pro Phe Ser His Phe Lys Arg Ser 

915 920 925 

Met Tyr Arg Ala Gly Arg Phe His Trp Glu Arg Phe Ser Asn Ala Ser 

930 935 940 

Pro Arg Tyr Glu Gly Glu Trp Glu Met Leu Arg Gin Ser Met Met Lys 
945 950 955 960 

His Gly Leu Arg Asn Ser Gin Phe He Ala Leu Met Pro Thr Ala Ala 

965 970 975 

Ser Ala Gin He Ser Asp Val Ser Glu Gly Phe Ala Pro Leu Phe Thr 

980 985 " 990 

Asn Leu Phe Ser Lys Val Thr Arg Asp Gly Glu Thr Leu Arg Pro Asn 

995 1000 1005 

Thr Leu Leu Leu Lys Glu Leu Glu Arg Thr Phe Gly Gly Lys Arg Leu 

1010 1015 " 1020 

Leu Asp Ala Met Asp Gly Leu Glu Ala Lys Gin Trp Ser Val Ala Gin 
1025 1030 1035 1040 

Ala Leu Pro Cys Leu Asp Pro Ala His Pro Leu Arg Arg Phe Lys Thr 

1045 1050 1055 

Ala Phe Asp Tyr Asp Gin Glu Leu Leu He Asp Leu Cys Ala Asp Arg 

1060 1065 1070 

Ala Pro Tyr Val Asp His Ser Gin Ser Met Thr Leu Tyr Val Thr Glu 

1075 1080 1085 

Lys Ala Asp Gly Thr Leu Pro Ala Ser Thr Leu Val Arg Leu Leu Val 

1090 1095 1100 

His Ala Tyr Lys Arg Gly Leu Lys Thr Gly Met Tyr Tyr Cys Lys Val 
1105 1110 H15 1120 

Arg Lys Ala Thr Asn Ser Gly Val Phe Ala Gly Asp Asp Asn He Val 
1125 H30 1135 
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Cys Thr Ser Cys Ala Leu 
1140 

<210> 4 
<211> 208 
<212> DNA 

<213> Herpes simplex virus 
<400> 4 

gcgccgcgcc cgcgtgccgc agaccacctc gcggcggctc ccccgcggcc tttcccgtgg 60 
ccctccacgc cgtggacgcc ccctcccaat tcgtcacctg gctcgccgtg cgctggctgc 120 
ggggggcggt gggtctcggg gccgtcctgt gcgggattgc gttttacgtg acgtcaatcg 180 
cccgaggcgc ataaaggtcc ggcggcca 208 

<210> 5 
<211> 64 
<212> PRT 

<213> 'Herpes simplex virus 
<400> 5 

Gly Ala Ala Pro Ala Cys Arg Arg Pro Pro Arg Gly Gly Ser Pro Ala 

1 5 10 15 

Ala Phe Pro Val Ala Leu His Ala Val Asp Ala Pro Ser Gin Phe Val 

20 25 30 

Thr Trp Leu Ala Val Arg Trp Leu Arg Gly Ala Val Gly Leu Gly Ala 

35 40 . 45 

Val Leu Cys Gly He Ala Phe Tyr Val Thr Ser He Ala Arg Gly Ala 
50 55 60 

<210> 6 
<211> 70 
<212> PRT 

<213> Herpes simplex virus 
<400> 6 

Arg Arg Ala Arg Val Pro Gin Thr Thr Ser Arg Arg Leu Pro Arg Gly 

15 10 15 

Leu Ser Arg Gly Pro Pro Arg Arg Gly Arg Pro Leu Pro He Arg His 

20 25 30 

Leu Ala Arg Arg Ala Leu Ala Ala Gly Gly Gly Gly Ser Arg Gly Arg 

35 40 45 

Pro Val Arg Asp Cys Val Leu Arg Asp Val Asn Arg Pro Arg Arg He 

50 55 60 

Lys Val Arg Arg Pro Ala 
65 70 

<210> 7 
<211> 146 
<212> PRT 

<213> Herpes simplex virus 
<400> 7 

Met Asp Pro Ala Leu Arg Ser Tyr His Gin Arg Leu Arg Leu Tyr Thr 

1 5 10 15 

Pro He Ala Arg Gly Val Asn Leu Ala Ala Arg Ser Pro Pro Leu Val 

20 25 30 

Arg Glu Ala Arg Ala Val Val Thr Pro Arg Pro Pro He Arg Pro Ser 
35 40 45 
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Ser 


Gly 


Lys Ala Ser 


Ser Asp 


Asp 


Ala 


Asp Val Gly Asp Glu Leu 


lie 




50 




55 






60 




Ala 


lie 


Ala Asp Ala 


Arg Gly 


Asp 


Pro 


Pro Glu Thr Leu Pro Pro 


Gly 


65 






70 






75 


80 


Ala 


Gly 


Gly Ala Ala 


Pro Ala 


Cys 


Arg 


Arg Pro Pro Arg Gly Gly 


Ser 






85 








90 95 




Pro 


Ala 


Ala Phe Pro 


Val Ala 


Leu 


His 


Ala Val Asp Ala Pro Ser 


Gin 






100 






105 


no 




Phe 


Val 


Thr Trp Leu 


Ala Val 


Arg 


Trp 


Leu Arg Gly Ala Val Gly 


Leu 






115 




120 




125 




Gly 


Ala 


Val Leu Cys 


Gly lie 


Ala 


Phe 


Tyr Val Thr Ser lie Ala 


Arg 




130 




135 






140 



Gly Ala 
145 



<210> 8 
<211> 137 
<212> DNA 

<213> Herpes simplx virus 
<400> 8 

ccccaccgcc cccccacagg cggcgcgtgc ggagggcggc ccgtgcgtcc ccccggtccc 60 
cgcgggccgc ccgtggcgct cggtgccccc ggtatggtat tccgccccca accccgggtt 120 
tcgtggcctg cgtttcc 137 

<210> 9 
<211> 430 
<212> DNA 

<213> Herpes simplex virus 
<400> 9 

atggaccggg aggcacttcg ggccatcagc cgcgggtgca agcccccttc gaccctggca 60 
aaactggtga ccgggctggg attcgcgatc cacggagcgc tcatcccggg gtcggagggg 120 
tgtgtctttg atagcagcca cccgaactac cctcatcggg taatcgtcaa ggcggggtgg 180 
tacgccagca cgaaccacga ggcgcggctg ctgagacgcc tgaaccaccc cgcgatccta 240 
cccctcctgg acctgcacgt cgtttctggg gtcacgtgtc tggtcctccc caagtatcac 300 
tgcgacctgt atacctatct gagcaagcgc ccgtctccgt tgggccacct acagataacc 360 
gcggtctccc ggcagctctt gagcgccatc gactacgtcc actgcgaagg catcatccac 420 
cgcgatatta 430 

<210> 10 
<211> 22 
<212> PRT 

<213> Herpes simplex virus 
<400> 10 

Trp Thr Gly Arg His Phe Gly Pro Ser Ala Ala Gly Ala Ser Pro Leu 

1 5 10 15 

Arg Pro Trp Gin Asn Trp 
20 

<210> 11 
<211> 143 
<212> PRT 

<213> Herpes simplex virus 
<400> 11 

Met Asp Arg Glu Ala Leu Arg Ala lie Ser Arg Gly Cys Lys Pro Pro 
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1 




5 


10 




15 




Oar 
OCJ. 


Thr 


T.^n Al a T»\7Q T.f^n \7al 
lieu nia xj_yo jjcu v ai 


Thr* m v TiPii Glv PVia AT a 


He 


His 


Gly 








25 


30 






Al A 




IXC IT 1. \J VJ±jr OCi, ulU 


Glv fwq Val Php A<?r> fipr 


Ser 


His 


Pro 






35 


40 45 








Asn 


ryr 


irxO nlS fixy Va.x lie 


veil liyo riXa \jJ.y lip 1 yx 


Ala 


Ser 


xxix 




50 


55 


60 








Asn 


His 


Glu Ala Arg Leu Leu 


Arg Arg Leu Asn His Pro 


Ala 


He 


Leu 


65 




70 


75 






80 


Pro 


Leu 


Leu Asp Leu His Val 


Val Ser Gly Val Thr Cys 


Leu 


Val 


Leu 






85 


90 




95 




Pro 


Lys 


Tyr His Cys Asp Leu 


Tyr Thr Tyr Leu Ser Lys 


Arg Pro 


Ser 






100 


105 


110 






Pro 


Leu 


Gly His Leu Gin He 


Thr Ala Val Ser Arg Gin 


Leu 


Leu 


Ser 






115 


120 125 








Ala 


He 


Asp Tyr Val His Cys 


Glu Gly He He His Arg 


Asp He 






130 


135 


140 









<210> 12 
<211> 481 
<212> PRT 

<213> Herpes simplex virus 
<400> 12 



Met 


Ala 


Cys 


Arg 


Lys 


Phe 


Cys 


Gly Val Tyr Arg Arg Pro 


Asp Lys 


Arg 


1 








5 






10 


15 




Gin 


Glu 


Ala 


Ser 


Val 


Pro 


Pro 


Glu Thr Asn Thr Ala Pro 


Ala Phe 


Pro 








20 








25 


30 




Ala 


Ser 


Thr 


Phe 


Tyr 


Thr 


Pro 


Ala Glu Asp Ala Tyr Leu 


Ala Pro 


Gly 






35 










40 ' 45 






Pro 


Pro 


Glu 


Thr 


He 


His 


Pro 


Ser Arg Pro Pro Ser Pro 


Gly Glu 


Ala 




50 










55 


60 






Ala 


Arg 


Leu 


Cys 


Gin 


Leu 


Gin 


Glu He Leu Ala Gin Met 


His Ser 


Asp 


65 










70 




75 




80 


Glu 


Asp 


Tyr 


Pro 


He 


Val 


Asp 


Ala Ala Gly Ala Glu Glu 


Glu Asp 


Glu 










85 






90 


95 




Ala 


Asp 


Asp 


Asp 


Ala 


Pro 


Asp 


Asp Val Ala Tyr Pro Glu 


Asp Tyr 


Ala 








100 








105 


110 




Glu 


Gly 


Arg 


Phe 


Leu 


Ser 


Met 


Val Ser Ala Ala Pro Leu 


Pro Gly 


Ala 






115 










120 125 






Ser 


Gly 


His 


Pro 


Pro 


Val 


Pro 


Gly Arg Ala Ala Pro Pro 


Asp Val 


Arg 




130 










135 


140 






Thr 


Cys 


Asp 


Thr 


Gly 


Lys 


Val 


Gly Ala Thr Gly Phe Thr 


Pro Glu 


Glu 


145 










150 




155 




160 


Leu 


Asp 


Thr 


Met 


Asp 


Arg 


Glu 


Ala Leu Arg Ala He Ser 


Arg Gly 


Cys 










165 






170 


175 




Lys 


Pro 


Pro 


Ser 


Thr 


Leu 


Ala 


Lys Leu Val Thr Gly Leu 


Gly Phe 


Ala 








180 








185 


190. 




He 


His 


Gly 


Ala 


Leu 


He 


Pro 


Gly Ser Glu Gly Cys Val 


Phe Asp 


Ser 






195 










200 205 






Ser 


His 


Pro 


Asn 


Tyr 


Pro 


His 


Arg Val He Val Lys Ala 


Gly Trp 


Tyr 




210 










215 


220 






Ala 


Ser 


Thr 


Ser 


His 


Glu 


Ala 


Arg Leu Leu Arg Arg Leu 


Asn His 


Pro 


225 










230 




235 




240 


Ala 


He 


Leu 


Pro 


Leu 


Leu 


Asp 


Leu His Val Val Ser Gly 


Val Thr 


Cys 










245 






250 


255 




Leu 


Val 


Leu 


Pro 


Lys 


Tyr 


His 


Cys Asp Leu Tyr Thr Tyr 


Leu Ser 


Lys 








260 








265 


270 
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Arg 


Pro 


Ser Pro Leu Gl \/ His 


Leu Gin Tie Thr Ala 

xicu \jxii lie x i ix nx a. 


Val r* Arn ftll r» 
veil QCi niy oi 11 






275 


280 


J 


Leu 


Leu 


Ser Ala Tie A on Tvr 


vox nxo \-*_y o xj_y o oxy 


Tl o Tl p H-5 Q Arrr 

lie lie n is Aig 




290 


295 


300 




Asp 


lie 


xj_y o x in uxu noil ixt. 


PHo Tl o Ben Thr Pro 
Irlie lie noil 1X1X rrXU 


biu /isn lie ^ys 


305 




JXU 






Leu 


Glv 


7\ or*i Php d~] \r HI a 111 a 


P\r<! Php Val 7\ >~rT f21 \r 

v^yo cue vai /ox y vrriy 


Pifrt S*>rt Q^*" Cav 

^ys iiiy oer oer 






325 






Pro 


Phe 


flxo x _y x. vji _y xxc rvl d 


fil \r Thr Tl e* 7\ er> Thr 
wXy llll lie rioL) lill 


nsn iiici trro ulU 






340 


345 


J JU 


Val 


Leu 


nxei O-Ljf /Tioty civ-* iyx 


Thr fZln Val Tie &en 
nil olil Val lie rlop 


lie iip oer fixa 






355 






Gly 


Leu 


Val Tie Phe Cln Thr 
v ax lie xTiit; uxu x i ix. 


Ala Val Hi q Thr* Ala 
nlo Veil ni o iiii riid 


oer lieu Jrne oer 




370 


375 


380 




Ala 


Pro 


Arrr A<?r*t Prrt (?1 n ZX-rrr 
xixy nop nu vain ftx y 


nxv) nu v-rVo ritojJ rloll 


ml t\ Tl a Til a 7i-f rr 
Ulll lie xxlct 4*11 y 


385 




390 


395 


400 


lie 


lie 


Arg Gin Ala Gin Val 


His Val Asp Glu Phe 


Pro Thr His Ala 






405 


410 


415 


Glu 


Ser 


Arg Leu Thr Ala His 


Tyr Arg Ser Arg Ala 


Ala Gly Asn Asn 






420 


425 


430 


Arg 


Pro 


Ala Trp Thr Arg Pro 


Ala Trp Thr Arg Tyr 


Tyr Lys lie His 






435 


440 


445 


Thr 


Asp 


Val Glu Tyr Leu lie 


Cys Lys Ala Leu Thr 


Phe Asp Ala Ala 




450 


455 


460 




Leu 


Ar.g 


Pro Ser Ala Ala Glu 


Leu Leu Arg Leu Pro 


Leu Phe His Pro 


4 65 




470 


475 


480 


Lys 











<210> 13 
<211> 501 
<212> DNA 

<213> Herpes simplex virus 
<400> 13 

gggggcgcgt ctacgaggag atcccctggg ttegggtata cgaaaacatc tgccttcgcc 60 
ggcaagaege eggeggggeg gccccgccgg gagacgcccc ggactccccg tacatcgagg 120 
eggaaaatec cctgtacgac tggggcgggt ctgccctctt ctcccctccg ggggccacac 180 
gcgccccgga cccgggacta agcctgtcgc ccatgcccgc ccgcccccgg accaacgcgc 240 
tggecaaega cggcccgaca aacgtcgccg ccctcagcgc cctgttgacg aagctcaaac 300 
gcggccgaca ccagagccat taaaaaaatg cgaccgccgg ccccaccgtc tcggtttccg 360 
gcccctttcc ccgtatgtct gttttcaata aaaagtaaca aacagagaaa aaaaaacagc 420 
gagttccgea tggtttgtcg tacgeaatta gctgtttatt gttttttttt tggggggggg 480 
aagagaaaaa gaaaaaagga g 501 

<210> 14 
<211> 106 
<212> PRT 

<213> Herpes simplex virus 
<400> 14 

Gly Arg Val Tyr Glu Glu lie Pro Trp Val Arg Val Tyr Glu Asn lie 

15 10 15 

Cys Leu Arg Arg Gin Asp Ala Gly Gly Ala Ala Pro Pro Gly Asp Ala 

20 25 . 30 

Pro Asp Ser Pro Tyr lie Glu Ala Glu Asn Pro Leu Tyr Asp Trp Gly 

35 40 45 

Gly Ser Ala Leu Phe Ser Pro Pro Gly Ala Thr Arg Ala Pro Asp Pro 
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50 

Gly Leu Ser Leu 
65 

Ala Asn Asp Gly 

Lys Leu Lys Arg 
100 

<210> 15 
<211> 722 
<212> PRT 
<213> Herpes simplex virus 



<400> 15 



Met 


Gin 


Arg 


Arg 


Ala 


Arg 


Gly 


Ala Ser Ser Leu 


Arg Leu Ala Arg 


Cys 


i 








r- 

b 






1 A 
10 


1 c 

15 




Leu 


Thr 


Pro 


Ala 


Asn 


Leu 


lie 


Arg Gly Ala Asn 


Ala Gly Val Pro 


Glu 








o a 








o a 
25 


OA 




Arg 


Arg 


Ti- 
lls 


Phe 


TV 1 *. 

Ala 


Gly 


Cys 


Leu Leu Pro Thr 


Pro Glu Gly Leu 


Leu 






JO 










a n 
4U 


a t; 




Ser 


Ala 


Ala 


Val 


Gly 


Val 


Leu 


Arg Gin Arg Ala 


Asp Asp Leu Gin 


Pro 




c a 










55 




60 




Ala 


Phe 


Leu 


Thr 


Gly 


7V "1 _ 

Ala 


Asp 


Arg Ser Val Arg 


Leu Ala Ala Arg 


HIS 


bo 










/U 








QA 


IT > _ 

His 


Asn 


Thr 


Val 


Pro 


Glu 


Ser 


Leu He Val Asp 


Gly Leu Ala Ser 


Asp 






















Pro 


His 


Tyr 


Asp 


Tyr 


lie 


Arg 


His Tyr Ala Ser 


Ala Ala Lys Gin 


Ala 








1 AA 

1UU 








1 A c 


11U 




Leu 


Gly 


Glu 


Val 


Glu 


Leu 


Ser 


Gly Gly Gin Leu 


Ser Arg Ala He 


Leu 






115 










120 


125 




Ala 


Gin 


Tyr 


Trp 


Lys 


Tyr 


Leu 


Gin Thr Val Val 


Pro Ser Gly Leu 


Asp 




130 










135 




140 




lie 


Pro 


Asp 


Asp 


Pro 


Ala 


Gly 


Asp Cys Asp Pro 


Ser Leu His Val 


Leu 


145 










150 




155 




160 


Leu 


Arg 


Pro 


Thr 


Leu 


Leu 


Pro 


Lys Leu Leu Val 


Arg Ala Pro Phe 


Lys 










165 






170 


175 




Ser 


Gly 


Ala 


Ala 


Ala 


Ala 


Lys 


Tyr Ala Ala Ala 


Val Ala Gly Leu 


Arg 








180 








185 


190 




Asp 


Ala 


Ala 


His 


Arg 


Leu 


Gin 


Gin Tyr Met Phe 


Phe Met Arg Pro 


Ala 






195 










200 ■ 


205 




Asp 


Pro 


Ser 


Arg 


Pro 


Ser 


Thr 


Asp Thr Ala Leu 


Arg Leu Ser Glu 


Leu 




210 










215 




220 




Leu 


Ala 


Tyr 


Val 


Ser 


Val 


Leu 


Tyr His Trp Ala 


Ser Trp Met Leu 


Trp 


225 










230 




235 




240 


Thr 


Ala 


Asp 


Lys 


Tyr 


Val 


Cys 


Arg Arg Leu Gly 


Pro Ala Asp Arg 


Arg 










245 






250 


255 




Phe 


Val 


Ala 


Leu 


Ser 


Gly 


Ser 


Leu Glu Ala Pro 


Ala Glu Thr Phe 


Ala 








260 








265 


270 




Arg 


His 


Leu 


Asp 


Arg 


Gly 


Pro 


Ser Gly Thr Thr 


Gly Ser Met Gin 


Cys 






275 










280 


285 




Met 


Ala 


Leu 


Arg 


Ala 


Ala 


Val 


Ser Asp Val Leu 


Gly His Leu Thr 


Arg 




290 










295 




300 




Leu 


Ala 


His 


Leu 


Trp 


Glu 


Thr 


Gly Lys Arg Ser 


Gly Gly Thr Tyr 


Gly 


305 










310 




315 




320 


lie 


Val 


Asp 


Ala 


lie 


Val 


Ser 


Thr Val Glu Val 


Leu Ser He Val 


His 










325 






330 


335 




His 


His 


Ala 


Gin 


Tyr 


lie 


He 


Asn Ala Thr Leu 


Thr Gly Tyr Val 


Val 








340 








345 


350 





55 

Ser Pro Met Pro Ala Arg 
70 

Pro Thr Asn Val Ala Ala 
85 90 
Gly Arg His Gin Ser His 
105 



60 

Pro Arg Thr Asn Ala Leu 
75 80 
Leu Ser Ala Leu Leu Thr 
95 
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Trp 


Ala 


Ser Asp Ser Leu Asn 


Asn Glu Tyr Leu Thr 


Ala Ala Val Asp 






355 


360 


365 


Ser 


Gin 


Glu Arg Phe Cys Arg 


Thr Ala Ala Pro Leu 


Phe Pro Thr Met 




370 


.375 


380 




Thr 


Ala 


Pro Ser Trp Ala Arg 


Met Glu Leu Ser lie 


Lys Ser Trp Phe 


385 




390 


395 


400 


Gly 


Ala 


Ala Leu Ala Pro Asp 


Leu Leu Arg Ser Gly 


Thr Pro Ser Pro 






405 


410 


415 


His 


Tyr 


Glu Ser lie Leu Arg 


Leu Ala Ala Ser Gly 


Pro Pro Gly Gly 






420 


425 


430 


Arg 


Gly 


Ala Val Gly Gly Ser 


Cys Arg Asp Lys lie 


Gin Arg Thr Arg 






435 


440 


445 


Arg 


Asp 


Asn Ala Pro Pro Pro 


Leu Pro Arg Ala Arg 


Pro His Ser Thr 




450 


. 455 


460 




Pro 


Ala 


Ala Pro Arg Arg Cys 


Arg Arg His Arg Glu 


Asp Leu Pro Glu 


465 




470 


475 


480 


Pro 


Pro 


His Val Asp Ala Ala 


Asp Arg Gly Pro Glu 


Pro Cys Ala Gly 






485 


490 


495 


.Arg 


Pro 


Ala Thr Tyr Tyr Thr 


His Met Ala Gly Ala 


Pro Pro Arg Leu 






500 


505 


510 


Pro 


Pro 


Arg Asn Pro Ala Pro 


Pro Glu Gin Arg Pro 


Ala Ala Ala Ala 






515 


520 


525 


Arg 


Pro 


Leu Ala Ala Gin Arg 


Glu Ala Ala Gly Val 


Tyr Asp Ala Val 




530 


535 


540 




Arg 


Thr 


Trp Gly Pro Asp Ala 


Glu Ala Glu Pro Asp 


Gin Met Glu Asn 


545 




550 


555 


560 


Thr 


Tyr 


Leu Leu Pro Asp Asp 


Asp Ala Ala Met Pro 


Ala Gly Val Gly 






565 


570 


575 


Leu 


Gly 


Ala Thr Pro Ala Ala 


Asp Thr Thr Ala Ala 


Ala Ala Trp Pro 






♦ 580 


585 


590 


Ala 


Glu 


Ser His Ala Pro Arg 


Ala Pro Ser Glu Asp 


Ala Asp Ser He 






595 


600 


605 


Tyr 


Glu 


Ser Val Gly Glu Asp 


Gly Gly Arg Val Tyr 


Glu Glu He Pro 




610 


615 


620 




Trp 


Val 


Arg Val Tyr Glu Asn 


lie Cys Pro Arg Arg 


Arcr Leu Ala Glv 


625 




630 


635 


640 


Gly 


Ala 


Ala Leu Pro Gly Asp 


Ala Pro Asp Ser Pro 


Tyr He Glu Ala 






645 


650 


655 


Glu 


Asn 


Pro Leu Tyr Asp Trp 


Gly Gly Ser Ala Leu 


Phe Ser Pro Arg 






660 


665 


670 


Arg 


Ala 


Thr Arg Ala Pro Asp 


Pro Gly Leu Ser Leu 


Ser Pro. Met Pro 






675 


680 


685 


Ala 


Arg 


Pro Arg Thr Asn Ala 


Leu Ala Asn Asp Gly 


Pro Thr Asn Val 




690 


695 


700 




Ala 


Ala 


Leu Ser Ala Leu Leu 


Thr Lys Leu Lys Arg 


Gly Arg His Gin 


705 




710 


715 


720 


Ser 


His 









<210> 16 
<211> 200 
<212> DNA 

<213> Herpes simplex virus 
<400> 16 

actgcaacgc aatcccatga aggccctgta tccgctcacc accaaggaac tcaagacttc 60 
cgaccccggg ggcgtgggcg gggaggggga ggaaggcgcg gaggggggcg ggtttgacga 120 
ggccaagttg gccgaggccc gagaaatgat ccgatatatg gctttggtgt cggccatgga 180 
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gcgcacggaa cacaaggcca 200 

<210> 17 
<211> 66 
<212> PRT 

<213> Herpes simplex virus 
<400> 17 

Leu Gin Arg Asn Pro Met Lys Ala Leu Tyr Pro Leu Thr Thr Lys Glu 

15 10 15 

Leu Lys Thr Ser Asp Pro Gly Gly Val Gly Gly Glu Gly Glu Glu Gly 

20 25 ~ 30 

Ala Glu Gly Gly Gly Phe Asp Glu Ala Lys Leu Ala Glu Ala Arg Glu 

35 40 45 

Met lie Arg Tyr Met Ala Leu Val Ser Ala Met Glu Arg Thr Glu His 

50 55 60 

Lys Ala 
65 

<210> 18 
<211> 904 
<212> PRT 

<213> Herpes simplex virus 



<400> 18 



Met 


Arg 


Gly 


Gly 


Gly 


Leu 


He Cys Ala 


Leu Val Val 


Gly Ala Leu Val 


1 








5 






10 


• 15 


Ala 


Ala 


Val 


Ala 


Ser 


Ala 


Ala Pro Ala 


Ala Pro Ala 


Ala Pro Arg Ala 








20 






25 




30 


Ser 


Gly 


Gly 


Val 


Ala 


Ala 


Thr Val Ala 


Ala Asn Gly 


Gly Pro Ala Ser 






35 








40 




45 


Arg 


Pro 


Pro 


Pro 


Val 


Pro 


Ser Pro Ala 


Thr Thr Lys 


Ala Arg Lys Arg 




50 










55 


60 


Lys 


Thr 


Lys 


Lys 


Pro 


Pro 


Lys Arg Pro 


Glu Ala Thr 


Pro Pro Pro Asp 


65 










70 




75 


80 


Ala 


Asn 


Ala 


Thr 


Val 


Ala 


Ala Gly His 


Ala Thr Leu 


Arg Ala His Leu 










85 






90 


95 


Arg 


Glu 


He 


Lys 


Val 


Glu 


Asn Ala Asp 


Ala Gin Phe 


Tyr Val Cys Pro 








100 






105 




110 


Pro 


Pro 


Thr 


Gly 


Ala 


Thr 


Val Val Gin 


Phe Glu Gin 


Pro Arg Arg Cys 






115 








120 




125 


Pro 


Thr 


Arg 


Pro 


Glu 


Gly 


Gin Asn Tyr 


Thr Glu Gly 


He Ala Val Val 




130 










135 


140 




Phe 


Lys 


Glu 


Asn 


He 


Ala 


Pro Tyr Lys 


Phe Lys Ala 


Thr Met Tyr Tyr 


145 










150 




155 


160 


Lys 


Asp 


Val 


Thr 


Val 


Ser 


Gin Val Trp 


Phe Gly His 


Arg Tyr Ser Gin 










165 






170 


175 


Phe 


Met 


Gly 


He 


Phe 


Glu 


Asp Arg Ala 


Pro Val Pro 


Phe Glu Glu Val 








180 






185 




190 


lie 


Asp 


Lys 


He 


Asn 


Thr 


Lys Gly Val 


Cys Arg Ser 


Thr Ala Lys Tyr 






195 








200 




205 


Val 


Arg 


Asn 


Asn 


Met 


Glu 


Thr Thr Ala 


Phe His Arg 


Asp Asp His Glu 




210 










215 


220 




Thr 


Asp 


Met 


Glu 


Leu 


Lys 


Pro Ala Lys 


Val Ala Thr 


Arg Thr Ser Arg 


225 










230 




235 


240 


Gly 


Trp 


His 


Thr 


Thr 


Asp 


Leu Lys Tyr 


Asn Pro Ser 


Arg Val Glu Ala 










245 






250 


255 


Phe 


His 


Arg 


Tyr 


Gly 


Thr 


Thr Val Asn 


Cys He Val 


Glu Glu Val Asp 



WO 02/02131 



PCTYUS01/20981 



12 



260 



Al a Am 


Ser 


Val 


Tvr Pro Tvr Asn 

J. y 1 L 1 \J J. J J- £XLJ £S 




97 R 




980 


Php Val 
rriic val 


fur 

iyr 


rJe L. 


Qqt prn Pho Tvr 
ocx nu tilt; i y x> 










Gin H-i <? 


Thr 

11 IX 


OCX 


Tvr 2il s Ala A <?n 
lyi fiia nj.a nop 










Tyr Ala 


Arg 


Asp 


Lpu Thr Thr Lvs 








0£«J 


riiy noli 


Leu. 


Leu 


Thr Thr Pro T.i/<3 
xllX x 11X r i.U l>yo 










irxo Jjys 


Arg 


Pro 


Ala Ua 1 Pxrc Th r 
ni a. Val vyo xllX 




jOO 




OOVJ 


Glu Mpf 

VJ1 U. UC7 I— 


XJCU 


A m 


Al a Gl ii Tvr Gl v 


O f \J 








Til a Tl <a 
nla lie 


Ser 


1 III. 


Thr* Pho Thr Thy* 
xlll Irllc 1111 111X 


ODj 








niy vai 


Asp 


Leu 


vaxy nop t»y& lie 








4nR 


2\ c;o Am 


Met 


Phe 


Al a. Arn T.vs Tvr 






AOO 




_ 

Gin Pro 




Tyr 


T\/r T.on Zil a Thr* 

iyx jjcu rixa nix 




A ^ R 




440 


Pyn T^OIl 


Leu. 


Oar 

oex 


noil 1111 lieu nla 


1 JU 








Am Gl n 
nl y Ul LI 


Gl n 


Asp 


Attt T .ire Pro Arn 
nx y Liy o nu niy 








4 70 


ulU nla 


Pro 


Ser 


7\ 1 —3 TV nn 71 1 0 Car 
nla nSIl nXa. uci 








485 


Qoy Tl o 

ocl JL XC 


gi n 

blU 


Pho 

■Tl lc 


Al s 7A rrr T.on Gl n 
nl d niy JjcU ulll 






jUU 




Hie Val 
nlS Val 


Asn 


Asp 


riet lieu oxy Aiy 










Ulll noil 


nis 


n ii 

UlU 


T on fViy T an I'm 

iicu inr X16U up 










Ben Jil a 
noil riJ. d 


Tl o 

IXC 


Ala 


Cor- al a Thr Val 
OCX nla llix v al 










lieu uiy 


Asp 


17-1 1 


Mot 1 ZX.1 a 17 a 1 Qar 
rJcL nla Val Del 










noii vai 


Tl o 

lie 


V clX 


fllT >-» 7A cn Car MoT" 
l7lll noil Dei HcL 






3 D V 




r*v*5 Tx/r 


Car 
OCX 


Am 

ajl y 


Prn T.^n Val S^r 

tlU XlCLl Val uCJL 








DUU 


T.dn Tl a 
XiCU lie 


gi n 

V3l u 


(11 XT 

vjxy 


fll n T.on Hi! x/ (21 11 

ulll JLlCU wl_y V31U 








fil R 


Asp Ala 


Leu 


Glu 


Pro Cys Thr Val 


625 






630 


Gly Gly 


Gly 


Tyr 


Val Tyr Phe Glu 








645 


Ser Arg 


Ala 


Asp 


Val Thr Thr Val 






660 




Thr Met 


Leu 


Glu 


Asp His Glu Phe 




675 




680 


His Glu 


lie 


Lys 


Asp Ser Gly Leu 


690 






695 


Arg Asn 


Gin 


Leu 


His Asp Leu Arg 


705 






710 


Arg Ala 


Asp 


Ala 


Asn Ala Ala Met 



265 


270 






Glu Phe Val 


Lp»u Ala Thr 

ueu nxa 1 111 


Gly 


Asp 




285 






Glv Tvr Arcr 

wJLjr 1 j 1 m y 


(11 n m v Cpr 

VJl KJL oiy OCX 


nio 


Thr 

X 111 




300 






Arcr Phe» Lvq 

y (tic xj y 0 


Gin Val Asn 
vjiii v a 1 noy 


Gl v 
vjxy 


Php 

filC 


315 






~-J u 


Ala Arcr Ala 


Thr Spr Pro 

X11X OCX 1 J- W 


Thr 

1 iii 


Thr 

x 11X 


330 




335 




Php Thr Val 
iriic 1 iix veil 


Ala Tm Aqt* 
nxa lip nop 


Tm 

irp 


Val 
V al 


345 


350 






Mot Thr TjV«3 


Trr> (21 ri (21 n 
up 0111 UlU 


Val 
v al 


A cn 
nop 




365 






fll u C or Ph<=i 
01 _y oci t no 


Arrr Pho 9or 
my nic ocx 


Oar 

OCX 


Asp 




380 






A*5n TiPii Thr 
noil jucu jLin 


Gl 11 Tvr Rpr 

V31U 1 jl uCl 


ucu 


^or 

OCl 


395 






400 


fil v Arcr A^o 

vji _y ni y nop 


Al a Arcr Gl u 
ni a niy uiu 


Al a 

ma 


Tip 
xxc 


410 




415 




A^sn Al a Thr 

noil <TTk Ci X11JL 


1110 11c u y 0 


Val 
v al 


Gl v 
uiy 


425 


430 






Glv Glv Phe 

vj_i_^y x lie 


TiPn Tip Ala 

ucu xie iii ci 


Tvr 

iyr 


Gl n 

Oil 1 




445 






Glu Lpu Tvr 

vjj>u lieu 1 jr <i. 


Va 1 Arn (21 11 

vai m y eiu 


Tvr 

iyr 


Mph 




460 






A*?ri Al a Thr 
non nia xnx 


Pro Al a Pro 
xriu m a c 1 u 


T 

i»eu 


- 

Arg 


475 






480 


Val Glu Arcr 

v ax vj j. Lt nj> y 


Tip T.vc; Th r 
xxc uyo xiix 


Thr 
1 111 


Cor 
OCX 


4 90 




*3 ^ J 




Pho Thr Tvr 
r 11c xiix 1 _y j_ 


Acri Hi « Tl 0 
noli £1X0 xxc 


(21 n 


Arg 


505 


510 






Tie Ala Val 

11<3 JTaJLu V cn 


Ala Tro Pv*? 
nxd up uyo 


CI n 

UlU 


T.on 
UC U 




525 






7\ on Cll n 2\ 1 a 
noil ulU nla 


Arf*f T.1/C T.on 
niy ij_y 0 iicU 


Asn 


pm 
riO 




540 






dl v Arcr Arcr 
uiy ni y ni y 


Val Qor Ala 
v ai OCX ni a 


Arrr 
nx y 


Mpt 


555 






JuU 


Thr Pv<! Val 

1 11 JL uyo vai 


Pro Va 1 Ala 
1 x vai ni a 


Prn 

tX U 


7\ cn 
nop 


570 




R7S 




Arn Val Ser 

ni y vai >— > ~ 1 


Cor Am Pro 
ocx nx y nu 


(21 v 
uiy 


Thr 

X 111 


585 


590 






Phe Arcr Tvr 

x ne nj. y 1 _y 1 


Glu A«50 (21 n 
viu nop 0111 


Gl v 
uiy 


Pm 

JTX 




605 






Z\ cn A*5n C21 n 
noii noii uiii 


T«pii Am T.011 
xicu nx y xjcu 


Thr 

X 111 


Arg 




620 






Glv Ht<? Arcr 

ui _y 1110 ni y 


Am 1 Tx/r Pho 
nx y xyx xiic 


T1p 
lie 


Pho 
xlle 


635 






640 


Glu Tyr Ala 


Tyr Ser His 


Gin 


Leu 


650 




655 




Ser Thr Phe 


lie Asp Leu 


Asn 


He 


665 


670 






Val Pro Leu 


Glu Val Tyr 


Thr 


Arg 




685 






Leu Asp Tyr 


Thr Glu Val 


Gin 


Arg 




700 






Phe Ala Asp 


lie Asp Thr 


Val 


He 


715 






720 


Phe Ala Gly 


Leu Cys Ala 


Phe 


Phe 
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725 










730 


735 


PI n 

VjX U 


uiy 




740 




T.on 
Xie u, 






Al a 
nla 

745 


Val Rl v 


T lV q Val Val Met Glv 
750 


Val 
V d-L 


Val 


755 


pi v 


V ct_L 


Val 

V Cl-L 


Car 


AT a 

760 


Val 
v al 


oex vaxy 


Val Q<=» t* Qpr Phe Met 

Vol OCX OCX C liC 

765 


Ser 


Asn 
110 


Pro 


cue 


Pi \7 

<axy 


Al a 

riXd 


Leu 
77^ 


AT a 


Val 


Pi \7 T.on 

uiy xie u 


T.on Val T.on Ala Pi \r 
ilCU VcLX Xie LI nld kJJ_y 

7fin 

(OU 


T 

Leu 


v dl 


rila 


7\ "I a 
nld 


.rile 


IT IlC 


Al a 


tile 


Arg 


T wr* Va 1 
± yi v dx 


T.on Pi n T.on PI n Attt 

UCU uJ.il llCli OX.ll r\±. y 


785 










790 








795 


800 


Asn 






Lys 


nld 

805 


Leu 


lyr 


•c X (J 


Leu 


JLUX 11 IX. 

810 


T.vb Pi n T.on T.vq Thr 
UjfO V7XLI Jjc u J-i_y O XI IX. 

815 




Asp 


Pro 


pi w 
820 


ijxy 


Val 


uxy 


(11 X r 

vjj.y 


PI n 

uJLU 

825 


PI \/ Pin 


Pin Pi w Ala Pin PI \r 
ulU uly nld ulU uiy 

830 


Gly 


Gly 


Phe 
835 


Asp 


Glu 


Ala 


Lys 


Leu 
840 


Ala 


Glu Ala 


Arg Glu Met lie Arg 
845 


Tyr 


Met 
850 


Ala 


Leu 


Val 


Ser 


Ala 
855 


Met 


Glu 


Arg Thr 


Glu His Lys Ala Arg 
860 


Lys 


Lys 


Gly 


Thr 


Ser 


Ala 


Leu 


Leu 


Ser 


Ser Lys 


Val Thr Asn Met Val 


865 










870 








875 


880 


Leu 


Arg 


Lys 


Arg 


Asn 
885 


Lys 


Ala 


Arg 


Tyr 


Ser Pro 
890 


Leu His Asn Glu Asp 
895 


Glu 


Ala 


Gly 


Asp 
900 


Glu 


Asp 


Glu 


Leu 









<210> 19 
<211> 443 
<212> DNA 

<213> Herpes simplex virus 
<400> 19 

ccctctccca caeggteggt gccccccatc tctgtttcat catcgtcccg gttgcgttgc 60 
gctttccggc cctcccgcac ccccgcgttc cggtgtctcg cggcccggcg ccatgatcac 120 
ggattgtttc gaagcagaca tcgcgatccc ctegggtate tcgcgccccg atgccgcggc 180 
getgeagegg tgcgagggtc gagtggtctt tctgccgacc atccgccgcc agetggeget 240 
cgcggacgtg gcgcacgaat cgttcgtctc eggaggagtt agtcccgaca cgttggggtt 300 
gttgctggcg taccgcaggc gcttccccgc ggtaatcacg cgggtgctgc ccacgcgaat 360 
cgtcgcctgc cccgtggacc tggggctcac gcacgccggc accgtcaatc tccgcaacac 420 
ctcccccgtc gacctctgca acg 443 

<210> 20 
<211> 37 
<212> PRT 

<213> Herpes simplex virus 
<400> 20 

Pro Leu Pro His Gly Arg Cys Pro Pro Ser Leu Phe His His Arg Pro 

15 10 15 

Gly Cys Val Ala Leu Ser Gly Pro Pro Ala Pro Pro Arg Ser Gly Val 

20 25 30 

Ser Arg Pro Gly Ala 
35 

<210> 21 
<211> 147 
<212> PRT 

<213> Herpes simplex virus 



<400> 21 
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PCT/US01/20981 

14 



Pro 


Leu 


Pro 


His 


Gly 


Arg 


Cys 


Pro Pro Ser Leu Phe His His Arg Pro 


1 








5 






10 15 


Gly 


Cys 


Val 


Ala 
20 


Leu 


Ser 


Gly 


Pro Pro Ala Pro Pro Arg Ser Gly Val 
25 30 


Ser 


Arg 


Pro 
35 


Gly 


Ala 


Met 


He 


Thr Asp Cys Phe Glu Ala Asp He Ala 
40 45 


He 


Pro 


Ser 


Gly 


He 


Ser 


Arg Pro Asp Ala Ala Ala Leu Gin Arg Cys 




50 










55 


60 


Glu 


Gly 


Arg 


Val 


Val 


Phe 


Leu 


Pro Thr He Arg Arg Gin Leu Ala Leu 


65 










70 




75 80 


Ala 


Asp 


Val 


Ala 


His 
85 


Glu 


Ser 


Phe Val Ser Gly Gly Val Ser Pro Asp 
90 95 


Thr 


Leu 


Gly 


Leu 
100 


Leu 


Leu 


Ala 


Tyr Arg Arg Arg Phe Pro Ala Val He 
105 HO 


Thr 


Arg 


Val 


Leu 


Pro 


Thr 


Arg He Val Ala Cys Pro Val Asp Leu Gly 






115 










120 125 


Leu 


Thr 
130 


His 


Ala 


Gly 


Thr 


Val 
135 


Asn Leu Arg Asn Thr Ser Pro Val Asp 
140 


Leu 


Cys 


Asn 













145 



<210> 22 
<211> 110 
<212> PRT 

<213> Herpes simplex virus 



<400> 22 



Met He 


Thr 


Asp 


Cys 


Phe 


Glu Ala 


1 






5 






Ser Arg 


Pro 


Asp 


Ala 


Ala 


Ala Leu 






20 








Phe Leu 


Pro 


Thr 


He 


Arg 


Arg Gin 




35 








40 


Glu Ser 


Phe 


Val 


Ser 


Gly 


Gly Val 


50 










55 


Leu Ala 


Tyr 


Arg 


Arg 


Arg 


Phe Pro 


65 








70 




Thr Arg 


He 


Val 


Ala 


Cys 


Pro Val 








85 






Thr Val 


Asn 


Leu 


Arg 


Asn 


Thr Ser 






100 







Asp lie Ala He Pro Ser Gly He 

10 15 
Gin Arg Cys Glu Gly Arg Val Val 
25 30 
Leu Ala Leu Ala Asp Val Ala His 
45 

Ser Pro Asp Thr Leu Gly Leu Leu 
60 

Ala Val He Thr Arg Val Leu Pro 

75 80 
Asp Leu Gly Leu Thr His Ala Gly 

90 95 
Pro Val Asp Leu Cys Asn 
105 HO 



<210> 23 
<211> 318 
<212> PRT 

<213> Herpes simplex virus 



<400> 23 

Met He Thr Asp Cys Phe Glu Ala Asp He Ala He Pro Ser Gly He 

15 10 15 

Ser Arg Pro Asp Ala Ala Ala Leu Gin Arg Cys Glu Gly Arg Val Val 

20 25 " 30 

Phe Leu Pro Thr He Arg Arg Gin Leu Ala Leu Ala Asp Val Ala His 

35 40 45 

Glu Ser Phe Val Ser Gly Gly Val Ser Pro Asp Thr Leu Gly Leu Leu 

50 55 60 

Leu Ala Tyr Arg Arg Arg Phe Pro Ala Val He Thr Arg Val Leu Pro 
65 70 75 80 
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Thr 


Ara 


lie Val Ala Cys 


Pro Val Asn T.en Gl v 


Tien Thr Hi c Ala v 
ucu i in. nxo ni ci ui y 






85 


90 


95 


Thr 


Val 


Asn Leu Aro Asn 


Thr Ser Pro Val flqn 


T.on C\r<3 Sen CZ 1 \r Asn 
Ucu L»yo Aoll uiy Ao^ 






100 


105 


1 1 fi 
11U 


Pro 


Val 


Ser Leu Val Pro 

»^S31 VI V Hi i. 1 \-/ 


Pro Val Phe Gin Glv 


r;l n Ala Thr A so Val 

V3X11 Ala XIli AokJ V cLX 






115 


1 90 


IPS 


Arg 


Leu 


Gl n Ser T.en Asn 


T.on Thr* T.on Avrr PVio 
UCU J. Ill Ucu AJL y 1: 11G 


Dm Vs* 1 Dm T an "Dm 
tiO Vdl triO LcU JriO 




130 




135 


140 


Thr 


Pro 


Leu AT a Arrr fil n 


Tip Val fll a Byrr T.^n 
■Lie V dJL nl cl Al y UCU 


Val Ala A-rrr CI \/ Tl p 
vo.1 Aid Aiy oiy Xic 


145 




150 


-j -j 


lOU 


Arg 


Asp 


Lpu Asn Pro Asn 


Pro Arrr Thr Prn CZ~\ w 
t iu nxy iiii nu vji_y 


CI n T.on Dm Ben T e»n 
ulU ucu .riO ao£> LcU 






165 


170 


1 7S 

J. / -J 


Asn 


Val 


Leu Tvr Tvt Asn 


Cl \/ Al a Arrr T.on Qor 

Vjljf Aid niy UCU OCX, 


T.mi Val Ala Acn Val 
Ucu v d J- Aid nop Vcl_L 






180 


185 


1 Qfi 

17U 


Gin 


Gin 


Leu A 1a Ser Val 

x*ia wci v cii 


Asn TVir fil n T.on Arrr 
11 iiii uj. u j_i ci u ni y 


Ocl Ucu Vdl Ucll noil 






195 


200 


205 


Met 


Val 


Tvr Ser lie Thr 


Gin Glv Thr Thr T.en 

vXU Ulj Ull 1111 llCU 


Tl o TiOn THr T-an T 1 o 
ixc ucu nix. Ucu lie 




210 




215 


220 


Pro 


Arg 


Leti Leu Al a Tien 


Ser Ala Rln Asn CZl \r 
oci r\±. a uXil Ao ly OXJy 


T\7"K* Val flon 7\ 1 -a T n*-> 
lyi Veil nSIi Md jjeu 


225 




230 


235 


240 


Leu 


Gin 


Met Gin Ser Val 


Thr Arg Glu Ala Ala 


Gin Leu lie His Pro 






245 


250 


255 


Glu 


Ala 


Pro Met Leu Met 


Gin Asp Gly Glu Arg 


Arg Leu Pro Leu Tyr 






260 


265 


270 


Glu 


Ala 


Leu Val Ala Trp 


Leu Ala His Ala Gly 


Gin Leu Gly Asp lie 






275 


280 


285 


Leu 


Ala 


Leu Ala Pro Ala 


Val Arg Val Cys Thr 


Phe Asp Gly Ala Ala 




290 




295 


300 


Val 


Val 


Gin Ser Gly Asp 


Met Ala Pro Val lie 


Arg Tyr Pro 


305 




310 


315 





<210> 24 
<211> 502 
<212> DNA 

<213> Herpes simplex virus 
<400> 24 

actgttgtag gggggaaaac acagttcegg gaaggcgttt attgeggaga gaggggggaa 60 
agaaagagaa acaaaagaaa eggcaagaaa gactcaagac gtgcgcgtga teggaaaaaa 120 
ggccgggggg atcccggtcg gggccgccag gtaaatggcc atgatgaccg cgaccatgag 180 
gtcgtccgcg gcaccgttgc gttttccgga gtacatgegg acgtcggtgt tgggagagac 240 
ggtttcgatg aggttgttga getgetegga cagatactcg acegggtegg tetgeaggeg 300 
caccgtcacg gagacgagct cctgggacgc catgacgccc ccggagttga actttttgat 360 
aaagtattcg aaggcgggcg tcttctgttt gttgagcaga aagaaggggt acaataccgc 420 
gccgccgggc ggctcgcagt gatagaagag gagcteggge cccgggccgt tggcccccgc 480 
egaggecagg atgcggtgca tc 502 

<210> 25 
<211> 135 
<212> PRT 

<213> Herpes simplex virus 
<400> 25 

Met His Arg lie Leu Ala Ser Ala Gly Ala Asn Gly Pro Gly Pro Glu 

1 5 10 15 

Leu Leu Phe Tyr His Cys Glu Pro Pro Gly Gly Ala Val Leu Tyr Pro 

20 25 30 

Phe Phe Leu Leu Asn Lys Gin Lys Thr Pro Ala Phe Glu Tyr Phe lie 
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35 


40 






45 






Lvs 


Lvs 


Phe Asn Ser Gly Gly 


Val 


Met 


Ala 


Ser Gin Glu Leu 


Val 


Ser 




50 


55 








60 






Val 


Thr 


Val Arg Leu Gin Thr 


Asp 


Pro 


Val 


Glu Tyr Leu Ser 


Glu 


Gin 


65 




70 








75 




80 


Leu 


Asn 


Asn Leu He Glu Thr 


Val 


Ser 


Pro 


Asn Thr Asp Val 


Arg Met 






85 






90 




95 




Tyr 


Ser 


Gly Lys Arg Asn Gly 


Ala 


Ala Asp 


Asp Leu Met Val 


Ala 


Val 






100 




105 




110 






He 


Met 


Ala He Tyr Leu Ala 


Ala 


Pro 


Thr 


Gly He Pro Pro 


Ala 


Phe 






115 


120 






125 






Phe 


Pro 


He Thr Arg Thr Ser 
















130 


135 















<210> 26 
<211> 734 
<212> PRT 

<213> Herpes simplex virus 
<400> 26 



Met 


Phe 


Gly 


Gin 


Gin 


■Leu 


Ala 


Ser 


Asp Val 


Gin Gin Tyr Leu 


Glu Arg 


1 








5 










10 




15 




Leu 


Glu 


Lvs 


Gin 
20 


Arq 


Gin 


Gin 


Lvs 

Jr w 


Val 
25 


Gly 


Val Asp Glu Ala 
30 


Ser 


Ala 


Glv 


Leu 


Thr 


Leu 


Glv 


Glv 


Asp 


Ala 


Leu Arg 


Val Pro Phe Leu 


Asp 


Phe 






35 










40 






45 






Ala 


Thr 


Ala 


Thr 


Pro 




Arg 


His 


Gin 


Thr 


Val Val Pro fil v 


Val Gly 




50 










55 








60 






Thr 


Leu 


His 


Asp 


Cys 


Cys 


Glu 


His 


Ser 


Pro 


Leu Phe Ser Ala 


Val 


Ala 


65 










70 










75 




80 


Arg 


Arg 


Leu 


Leu 


Phe 


Asn 


Ser 


Leu 


Val 


Pro 


Ala Gin Leu Arg 


Gly Arg 










85 










90 




95 




Asp 


Phe 


Gly 


Gly 
100 


Asp 


His 


Thr 


Ala 


Lys 
105 


Leu 


Glu Phe Leu Ala 
110 


Pro 


Glu 


Leu 


Val 


Arg 
115 


Ala 


Val 


Ala 


Arg 


Leu 
120 


Arg 


Phe 


Arg Glu Cys Ala 
125 


Pro 


Glu 


Asp 


Ala 
130 


Val 


Pro 


Gin 


Arg 


Asn 
135 


Ala 


Tyr 


Tyr 


Ser Val Leu Asn 
140 


Thr 


Phe 


Gin 


Ala 


Leu 


His 


Arg 


Ser 


Glu 


Ala 


Phe Arg 


Gin Leu Val His 


Phe 


Val 


145 










150 










155 




160 


Arg 


Asp 


Phe 


Ala 


Gin 
165 


Leu 


Leu 


Lys 


Thr 


Ser 
170 


Phe Arg Ala Ser 


Ser 
175 


Leu 


Ala 


Glu 


Thr 


Thr 
180 


Gly 


Pro 


Pro 


Lys 


Lys 
185 


Arg 


Ala Lys Val Asp 
190 


Val 


Ala 


Thr 


His 


Gly 
195 


Gin 


Thr 


Tyr 


Gly 


Thr 
200 


Leu 


Glu 


Leu Phe Gin Lys 
205 


Met 


He 


Leu 


Met 
210 


His 


Ala 


Thr 


Tyr 


Phe 
215 


Leu 


Ala 


Ala 


Val Leu Leu Gly 
220 


Asp 


His 


Ala 


Glu 


Gin 


Val 


Asn 


Thr 


Phe 


Leu 


Arg 


Leu 


Val Phe Glu He 


Pro 


Leu 


225 










230 










235 




240 


Phe 


Ser 


Asp 


Thr 


Ala 


Val 


Arg 


His 


Phe Arg 


Gin Arg Ala Thr 


Val 


Phe 










245 










250 




255 




Leu 


Val 


Pro 


Arg 
260 


Arg 


Bis 


Gly 


Lys 


Thr 
265 


Trp 


Phe Leu Val Pro 
270 


Leu 


He 


Ala 


Leu 


Ser 
275 


Leu 


Ala 


Ser 


Phe 


Arg 
280 


Gly 


He 


Lys He Gly Tyr 
285 


Thr 


Ala 


His 


He 
290 


Arg 


Lys 


Ala 


Thr 


Glu 
295 


Pro 


Val 


Phe 


Asp Glu He Asp 
300 


Ala 


Cys 
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Leu 


Arg 


Gly Trp Phe Gly Ser 


Ser Arg Val Asp 


His Val Lys Gly Glu 


305 




310 


315 


320 


Thr 


He 


Ser Phe Ser Phe Pro 


Asp Gly Ser Arg 


Ser Thr He Val Phe 






325 


330 


335 


Ala 


Ser 


Ser His Asn Thr Asn 


Gly He Arg Gly 


Gin Asp Phe Asn Leu 






340 


345 


350 


Leu 


Phe 


Val Asp Glu Ala Asn 


Phe He Arg Pro 


Asp Ala Val Gin Thr 






355 


360 


365 


lie 


Met 


Gly Phe Leu Asn Gin 


Ala Asn Cys Lys 


He He Phe Val Ser 




370 


375 




380 


Ser 


Thr 


Asn Thr Gly Lys Ala 


Ser Thr Ser Phe 


Leu Tyr Asn Leu Arg 


385 




390 


395 


400 


Gly 


Ala 


Ala Asp Glu Leu Leu 


Asn Val Val Thr 


Tyr lie Cys Asp Asp 






405 


410 


415 


His 


Met 


Pro Arg Val Val Thr 


His Thr Asn Ala 


Thr Ala Cys Ser Cys 






420 


425 


430 


Tyr 


He 


Leu Asn Lys Pro Val 


Phe He Thr Met 


Asp Gly Ala Val Arg 






435 


440 


445 


Arg 


Thr 


Ala Asp Leu Phe Leu 


Pro Asp Ser Phe 


Met Gin Glu He He 




450 


455 




460 


Gly 
-* 


Gly 


Gin Ala Arg Glu Thr 


Gly Asp Asp Arg 


Pro Val Leu Thr Lys 


465 




470 


475 


480 


Ser 


Ala 


Gly Glu Arg Phe Leu 


Leu Tyr Arg Pro 


Ser Thr Thr Thr Asn 






485 


490 


4 95 


Ser 


Gly 


Leu Met Ala Pro Glu 


Leu Tyr Val Tyr 


Val Asp Pro Ala Phe 






500 


505 


510 


Thr 


Ala 


Asn Thr Arg Ala Ser 


Gly Thr Gly He 


Ala Val Val Gly Arg 






515 


520 


525 


Tvr 


Ara 


Asp Asp Phe He He 


Phe Ala Leu Glu 


His Phe Phe Leu Arg 




530 


535 




540 


Ala 


Leu 


Thr Gly Ser Ala Pro 


Ala Asp He Ala 


Arg Cys Val Val His 


545 




550 


555 


560 


Ser 


Leu 


Ala Gin Val Leu Ala 


Leu His Pro Gly 


Ala Phe Arg Ser Val 






565 


570 


575 


Ara 


Val 


Ala Val Glu Gly Asn 


Ser Ser Gin Asp 


Ser Ala Val Ala He 






580 


585 


590 


Ala 


Thr 


His Val His Thr Glu 


Met His Arg He 


Leu Ala Ser Ala Gly 






595 


600 


605 


Ala 


Asn 


Gly Pro Gly Pro Glu 


Leu Leu Phe Tyr 


His Cys Glu Pro Pro 




610 


615 




620 


Gly 


Gly 


Ala Val Leu Tyr Pro 


Phe Phe Leu Leu 


Asn Lys Gin Lys Thr 


625 




630 


635 


640 


Pro 


Ala 


Phe Glu Tyr Phe He 


Lys Lys Phe Asn 


Ser Gly Gly Val Met 


Ala 




645 


650 


655 


Ser 


Gin Glu Leu Val Ser 


Val Thr Val Arg 


Leu Gin Thr Asp Pro 






660 


665 


670 


Val 


Glu 


Tyr Leu Ser Glu Gin 


Leu Asn Asn Leu 


He Glu Thr Val Ser 






675 


680 


685 


Pro 


Asn 


Thr Asp Val Arg Met 


Tyr Ser Gly Lys 


Arg Asn Gly Ala Ala 




690 


695 




700 


Asp 


Asp 


Leu Met Val Ala Val 


He Met Ala He 


Tyr Leu Ala Ala Pro 


705 




710 


715 


720 


Thr 


Gly 


lie Pro Pro Ala Phe 


Phe Pro He Thr 


Arg Thr Ser 






725 


730 





<210> 27 
<211> 15 
<212> PRT 
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<213> HSV-2 
<400> 27 

Gly Arg Val Tyr Glu Glu lie Pro Trp Val Arg Val Tyr Glu Asn 
5 10 15 



<210> 28 
<211> 15 
<212> PRT 
<213> HSV-2 

<400> 28 

Tyr Glu Asn lie Cys Leu Arg Arg Gin Asp Ala Gly Gly Ala Ala 
5 10 15 



<210> 29 
<211> 15 
<212> PRT 
<213> HSV-2 

<400> 29 

Pro Asp Ser Pro Tyr lie Glu Ala Glu Asn Pro Leu Tyr Asp Trp 
5 10 15 



<210> 30 
<211> 15 
<212> PRT 
<213> HSV-2 

<400> 30 

Tyr lie Glu Ala Glu Asn Pro Leu Tyr Asp Trp Gly Gly Ser Ala 
5 10 15 



<210> 31 
<211> 15 
<212> PRT 
<213> HSV-2 

<400> 31 

Thr Asn Ala Leu Ala Asn Asp Gly Pro Thr Asn Val Ala Ala Leu 
5 10 15 



<210> 32 
<211> 15 
<212> PRT 
<213> HSV-2 

<400> 32 

Arg Val Leu Pro Thr Arg lie Val Ala Cys Pro Val Asp Leu Gly 
5 10 15 



<210> 33 
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<211> 15 
<212> PRT 
<213> HSV-2 

<400> 33 

Thr Arg lie Val Ala Cys Pro Val Asp Leu Gly Leu Thr His Ala 
5 10 15 



<210> 34 
<211> 661 
<212> DNA 
<213> HSV-2 



<400> 34 

ctcctcttcc gcctcctcct cctcctcttc 
ctctgcgggc ggggctggtg ggagcgtcgc 
aacctccctc ggcccccgcg ctgctgcgcc 
gcgccacgcg gagggcggcc ccgagcccgg 
ctacctgccc atcgcggggg tctcgagcgt 
ggtcacgggg gactgcctgc ccgtcctgga 
ggtcctcgtg gaccagacgg ggaacgtggc 
gagccgccgc accctgctcc ccgagcacgc 
gacgcccccc gcgtcggagt ggaacagcct 
tgaccagggc accctggtgg gcgcgctgga 
gtctcgggag cagggcgcgc ccgcgccggc 

g 



cgcctcctcc tcctcctcct ccgcctcttc 60 
gtccgcgtcc ggcgctgggg agagacgaga 120 
gcgggggccg aggaagtgtg ccaggaagac 180 
ggcccgcgac ccggcgcccg gcctcacgcg 240 
cgtggccctg gcgccttacg tgaacaagac 300 
catggagacg ggccacatag gggcctacgt 360 
ggacctgctg cgggccgcgg cccccgcgtg 420 
gcgcaactgc gtgaggcccc ccgactaccc 480 
ctggatgacc ccggtgggca acatgctctt 540 
cttccacggc ctccggtcgc gccacccgtg 600 
cggcgacgcc cccgcgggcc acggggagta 660 

661 



<210> 35 
<211> 2481 
<212> DNA 
<213> HSV-2 



<400> 35 

atggaacccc ggcccggcac gagctcccgg 
cagacccccg gcacgcagcc cgccgccccg 
tggctcgcca gcagcgactc ggaggaggag 
caccgcgact ccacctccga ggcgggcagc 
atggacgcgg ccacgccccc ggcccggccc 
gccgacgcgc agggatcctg tgggggtggg 
ggggggggcg acgtgtgtgc cgtgtgcacg 
agttttccct gcctgcaccc cttctgcatc 
aacacgtgtc ccctgtgcaa caccccggtg 
gggtcgttca gcaccatccc gatagtgaac 
gccgtgcggg ccggcacggc cgtggacttt 
cgctccctgt cgctgggggg acacacggtc 
ggcacggacg acgaggacga tgacctggcc 
cgaagagcgc cccggcgcgg gggcggcggt 
gccgcgaccc gaccggcgcc ccctggcgcc 
ttgcgggcgg gggtgggatc tgggtctggg 
agagtggcct ctcttccccc tgcggccggc 
gaagacgccg cggcggcgga gggcaggacg 
gagcccccca tagtcatcag cgactctccc 
gggccgctct cctttgtctc ctcctcctcc 
ggtctgccac agtcgtcggg gcgcgccgcg 
cggagtccgc cccgcgccgc cgccgccccc 
cccgcgccgc ccgccgtgcc ggtggacgcg 
gctcagaccg acacccaagc acagagtctg 



gcggaccccg gccccgagcg gccgccgcgg 60 
cacgcctggg ggatgctcaa cgacatgcag 120 
accgaggtgg gaatctctga cgacgacctt 180 
acggacacgg agatgttcga ggcgggcctg 240 
ccggccgagc gccagggcag ccccacgccc 300 
cccgtgggtg aggaggaagc ggaagcggga 360 
gacgagatcg ccccgcccct gcgctgccag 420 
ccgtgcatga agacctggat tccgttgcgc 480 
gcgtacctga tagtgggcgt gaccgccagc 540 
gacccccgga cccgcgtgga ggccgaggcg 600 
atctggacgg gcaacccgcg gacggccccg 660 
cgcgccctgt cgcccacccc cccgtggccc 720 
gacggtgtgg actacgtccc gcccgccccc 780 
gcgggggcga cccgcggaac ctcccagccc 840 
ccgcggagca gcagcagcgg cggcgccccg 900 
ggcggccctg ccgtcgcggc cgtcgtgccg 960 
ggggggcgcg cgcaggcgcg gcgggtgggc 1020 
ccccccgcga gacagccccg cgcggcccag 1080 
ccgccgtctc cgcgccgccc cgcgggcccc 1140 
gcacaggtgt cctcgggccc cgggggggga 1200 
cgcccccgcg cggccgtcgc cccgcgcgtc 1260 
gtggtgtctg cgagcgcgga cgcggccggg 1320 
caccgcgcgc cccggtcgcg catgacccag 1380 
ggccgggcag gcgcgaccga cgcgcgcggg 1440 
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tcgggagggc cgggcgcgga gggaggaccc 
gccgcccccc acgccgcgga gggggcggcg 
tcgggccccg cggcctcgtc ctccgcctct 
ccccaggggg tgggggccaa gagggcggcg 
gaccgcggcc acgggccgct cgccccggcg 
ccgtcgtccc aggccgcggt cgccgccgcc 
tcctcctcct ccgcctcctc ctcctccgcc 
tcctcctcct ccgcctcttc ctctgcgggc 
ggcgctgggg agagacgaga aacctccctc 
aggaagtgtg ccaggaagac gcgccacgcg 
ccggcgcccg gcctcacgcg ctacctgccc 
gcgccttacg tgaacaagac ggtcacgggg 
ggccacatag gggcctacgt ggtcctcgtg 
cgggccgcgg cccccgcgtg gagccgccgc 
gtgaggcccc ccgactaccc gacgcccccc 
ccggtgggca acatgctctt tgaccagggc 
ctccggtcgc gccacccgtg gtctcgggag 
cccgcgggcc acggggagta g 

<210> 36 
<211> 1603 
<212> DNA 
<213> HSV-2 



ggggtccccc gcggcaccaa cacccccggt 1500 

gcccgccccc ggaagaggcg cgggtcggac 1560 

tcctccgccg ccccgcgctc gcccctcgcc 1620 

ccgcgccggg ccccggactc ggactcgggc 1680 

tccgcgggcg ccgcgccccc gtcggcgtct 1740 

tcctcctcct ccgcctcctc ctcctccgcc 1800 

tcctcctcct ccgcctcctc ctcctccgcc 1860 

ggggctggtg ggagcgtcgc gtccgcgtcc 1920 

ggcccccgcg ctgctgcgcc gcgggggccg 1980 

gagggcggcc ccgagcccgg ggcccgcgac 2040 

atcgcggggg tctcgagcgt cgtggccctg 2100 

gactgcctgc ccgtcctgga catggagacg 2160 

gaccagacgg ggaacgtggc ggacctgctg 2220 

accctgctcc ccgagcacgc gcgcaactgc 2280 

gcgtcggagt ggaacagcct ctggatgacc 2340 

accctggtgg gcgcgctgga cttccacggc 2400 

cagggcgcgc ccgcgccggc cggcgacgcc 2460 

2481 



<400> 36 

cggccggagg gctgtcccgc atcgatatca 
cgcgcacgag cgcggcgtcg cacccgaacg 
gccacgtcga ggccgacggg gagaggtaca 
tggcggccag ccaggtcacg gatgcgttgt 
gatgcctcgg tgtccccgcg ggcggccccg 
gcctctcggt cgccccgtcg tctccccgcg 
cgcgaatgtt acccaggccc gggaccgcaa 
tcccttgagc tgggtcacct cggcgggggg 
gggcaggctc gcgttccgag gccggccgag 
cccggggtcc cgaggaaact cggccgtggt 
ccaccgaagc atgggcacgg atgccgtagt 
tccgcgctgg gggtcgctgg cgtcgggggt 
gtcgcgccag tacgtgcggg ccaaccccag 
cgtccagtac gtgtcatggc ccccggcaac 
accggggccc atggtcgtcc gtcccggctg 
cccacggccc aaagaggctc cgcacctcgg 
cgcgggccaa aggatggggt ggggtgaggg 
acaaggacgg atcggcccgc tccgtttttc 
ccccgaaccc ccgcgcttat ggacacacca 
ttattgccgt catcgccggg aggccttccg 
cccacctcgc gggcaaacgt gcgcgccagg 
gtgacgcggg ttgggatcat cccggcggtg 
ggcgactgat cgtaatccag gacaaataga 
tccaagaccc aggcaaaaat gtggtacaag 
cggaacaggg caaacagcgt gtcctcgatg 
tcggggcgcg gggtcgccgc ggcgaccccc 
ccgccgcgct gcaggtaccg caccgtgttg 

<210> 37 
<211> 1131 
<212> DNA 
<213> HSV-2 



cgagccccat gaagcccttc ccgtatcgcg 60 
ccagcccgcc cgtcgtccag acgcccacgg 120 
cgtaccgacc cggagtccgt agcaggcccc 180 
gcagatgcgc gatgctcagg ttcgtcgtcg 240 
ggggcggcgc gttgcgtcgg ccgtccgggt 300 
ggaacgtaag cccctcgcgg tccgcgcggc 360 
cagcgcggag gcgccggggt tgtgcgacag 420 
acgggacgtg ggccccgcct cggggagctc 480 
cagataggtc tttgggatgt aaagcagctg 540 
gaccaacacg aaacaaaagc gctcggcgta 600 
caggttgagt tcgcccgggg gcgccaagcg 660 
tgttgggcaa ccacagacgc ccggtgtttt 720 
accgtgcaaa aaccacgggt cgatttgctc 780 
gcccaccagg acccccatca ccacccacag 840 
ccagtccgca gatggggggg ggtgtccgta 900 
aggctatcgg aggccctttg ttgccgtaag 960 
taaaagcaca aagggagtac cagaccgaaa 1020 
ggtggggtgc tgatacggtg ccagccctgg 1080 
cacgacaaca atgcctttta ttctgttctt 1140 
ttcgggcttc cgtgtttgaa ctaaactccc 1200 
tcgcgtatct cggcgatgga cccggcggtt 1260 
aggcgcaaca gggcgtctcg acacccgacg 1320 
tgcatcggaa ggaggcggtc ggccaagacg 1380 
tccccgttgg gggccagcag ctcgggaacg 1440 
cggggcagag accccgcgcc gtcctcgggg 1500 
gtcagccggc cccagtcctc ccgccacctc 1560 
gcgagtagat cgt 1603 
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<400> 37 

atggcttctc acgccggcca acagcacgcg cctgcgttcg gtcaggctgc tcgtgcgagc 60 
gggcctaccg acggccgc^c ggcgtcccgt cctagccatc gccagggggc ctccggagcc 120 
cgcggggatc cggagctgcc cacgctgctg cgggtttata tagacggacc ccacggggtg 180 
gggaagacca ccacctccgc gcagctgatg gaggccctgg ggccgcgcga caatatcgtc 240 
tacgtccccg agccgatgac ttactggcag gtgctggggg cctccgagac cctgacgaac 300 
atctacaaca cgcagcaccg tctggaccgc ggcgagatat cggccgggga ggcggcggtg 360 
gtaatgacca gcgcccagat aacaatgagc acgccttatg cggcgacgga cgccgttttg 420 
gctcctcata tcggggggga ggctgtgggc ccgcaagccc cgcccccggc cctcaccctt 480 
gttttcgacc ggcaccctat cgcctccctg ctgtgctacc cggccgcgcg gtacctcatg 540 
ggaagcatga ccccccaggc cgtgttggcg ttcgtggccc tcatgccccc gaccgcgc'cc 600 
ggcacgaacc tggtcctggg tgtccttccg gaggccgaac acgccgaccg cctggccaga 660 
cgccaacgcc cgggcgagcg gcttgacctg gccatgctgt ccgccattcg ccgtgtctac 720 
gatctactcg ccaacacggt gcggtacctg cagcgcggcg ggaggtggcg ggaggactgg 780 
ggccggctga cgggggtcgc cgcggcgacc ccgcgccccg accccgagga cggcgcgggg 840 
tctctgcccc gcatcgagga cacgctgttt gccctgttcc gcgttcccga gctgctggcc 900 
cccaacgggg acttgtacca catttttgcc tgggtcttgg acgtcttggc cgaccgcctc 960 
cttccgatgc atctatttgt cctggattac gatcagtcgc ccgtcgggtg tcgagacgcc 1020 
ctgttgcgcc tcaccgccgg gatgatccca acccgcgtca caaccgccgg gtccatcgcc 1080 
gagatacgcg acctggcgcg cacgtttgcc cgcgaggtgg ggggagttta g 1131 

<210> 38 
<211> 2517 
<212> DNA 
<213> HSV-2 

<400> 38 

atgggccccg gtctgtgggt ggtgatgggg gtcctggtgg gcgttgccgg gggccatgac 60 
acgtactgga cggagcaaat cgacccgtgg tttttgcacg gtctggggtt ggcccgcacg 120 
tactggcgcg acacaaacac cgggcgtctg tggttgccca acacccccga cgccagcgac 180 
ccccagcgcg gacgcttggc gcccccgggc gaactcaacc tgactacggc atccgtgccc 24 0 
atgcttcggt ggtacgccga gcgcttttgt ttcgtgttgg tcaccacggc cgagtttcct 300 
cgggaccccg ggcagctgct ttacatccca aagacctatc tgctcggccg gcctcggaac 360 
gcgagcctgc ccgagctccc cgaggcgggg cccacgtccc gtccccccgc cgaggtgacc 420 
cagctcaagg gactgtcgca caaccccggc gcctccgcgc tgttgcggtc ccgggcctgg 480 
gtaacattcg cggccgcgcc ggaccgcgag gggcttacgt tcccgcgggg agacgacggg 540 
gcgacc'gaga ggcacccgga cggccgacgc aacgcgccgc ccccggggcc gcccgcgggg 600 
acaccgaggc atccgacgac gaacctgagc atcgcgcatc tgcacaacgc atccgtgacc 660 
tggctggccg ccaggggcct gctacggact ccgggtcggt acgtgtacct ctccccgtcg 720 
gcctcgacgt ggcccgtggg cgtctggacg acgggcgggc tggcgttcgg gtgcgacgcc 780 
gcgctcgtgc gcgcgcgata cgggaagggc ttcatggggc tcgtgatatc gatgcgggac 840 
agccctccgg ccgagatcat agtggtgcct gcggacaaga ccctcgctcg ggtcggaaat 900 
ccgaccgacg aaaacgcccc cgcggtgctc cccgggcctc cggccggccc caggtatcgc 960 
gtctttgtcc tgggggcccc gacgcccgcc gacaacggct cggcgctgga cgccctccgg 1020 
cgggtggccg gctaccccga ggagagcacg aactacgccc agtatatgtc gcgggcctat 1080 
gcggagtttt tgggggagga cccgggctcc ggcacggacg cgcgtccgtc cctgttctgg 1140 
cgcctcgcgg ggctgctcgc ctcgtcgggg tttgcgttcg tcaacgcggc ccacgcccac 1200 
gacgcgattc gcctctccga cctgctgggc tttttggccc actcgcgcgt gctggccggc 1260 
ctggccgccc ggggagcagc gggctgcgcg gccgactcgg tgttcctgaa cgtgtccgtg 1320 
ttggacccgg cggcccgcct gcggctggag gcgcgcctcg ggcatctggt ggccgcgatc 1380 
ctcgagcgag agcagagcct ggtggcgcac gcgctgggct atcagctggc gttcgtgttg 1440 
gacagccccg cggcctatgg cgcggtggcc ccgagcgcgg cccgcctgat cgacgccctg 1500 
tacgccgagt ttctcggcgg ccgcgcgcta accgccccga tggtccgccg agcgctgttt 1560 
tacgccacgg ccgtcctccg ggcgccgttc ctggcgggcg cgccctcggc cgagcagcgg 1620 
gaacgcgccc gccggggcct cctcataacc acggccctgt gtacgtccga cgtcgccgcg 1680 
gcgacccacg ccgatctccg ggccgcgcta gccaggaccg accaccagaa aaacctcttc 1740 
tggctcccgg accacttttc cccatgcgca gcttccctgc gcttcgatct cgccgagggc 1800 
gggttcatcc tggacgcgct ggccatggcc acccgatccg acatcccggc ggacgtcatg 1860 
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gcacaacaga 
atccgcgcct 
ccccggatcc 
ccccgcggga 
tatctcaccg 
accgaaaacc 
ggggaggtca 
gggccggtgg 
ttgttcccca 
atcgcccccg 
gcgggcatcc 

<210> 39 
<211> 376 ' 
<212> PRT 
<213> HSV-2 



cccgcggcgt 
tcgtcccgga 
tcgtgcccat 
tcggatacaa 
ccacctgcga 
ggcgcgacct 
tgtcggtgct 
cgggcacccc 
acggaactgt 
ggtttctggc 
ttagggtggt 



ggcctccgtt 
ggccacccac 
cacccacaac 
gcttacgggc 
agggcacgcg 
cggcctcgtg 
gctggtggac 
gaacgtgttt 
gattcatctg 
cgcgtccgcg 
ccgaacgtgc 



ctcacgcgct 
cagtgtagcg 
gccagctacg 
gttgacgtcc 
cgggagattg 
ggggccgtgt 
acggatgcca 
tccagcgacg 
ctggcctttg 
ctgggggtcg 
gtcccatttt 



gggcgcacta 
gcccgtcgca 
tcgtcaccca 
gccgcccgct 
agccgaagcg 
ttctgcgcta 
cccaacagca 
tgccgtccgt 
acacgctgcc 
ttatgattac 
tgtggagacg 



caacgccctg 
caacgcggag 
cacccccttg 
gtttatcacc 
gctggtgcgc 
caccccggcc 
gctggcccag 
ggccctgttg 
catcgccacc 
cgcggccctg 
cgaataa 



1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2517 



<400> 39 



Met Ala 


Ser 


His 


Ala 


Glv 


Gin 


Gin 


His 


Ala 


Pro 


Ala 


Phe 


Glv Gin 


Ala 








5 










10 








15 




Ala Arg 


Ala 


Ser 


Glv 
y 


Pro 


Thr 


Asp 


Glv 


Arg 


Ala 


Ala 


Ser 


Arg Pro 


Ser 






20 










25 










30 




His Arg 


Gin 


Glv 


Ala 


Ser 


Glv 


Ala 


Arg 


Glv 
y 


Asp 


Pro 


Glu 


Leu Pt*o 


Thr 




35 










40 










45 






Leu Leu 


Arg 


Val 


Tvr 


He 


Asp 


Glv 


Pro 


His 


Glv 
y 


Val 


Gly 


Lys Thr 


Thr 


50* 










55 










60 










Ala 


Gin 


Leu 


Met 


Glu 


Ala 


Leu 


Glv 


Pro 


Arg 


Asp 




V ctl 


65 








70 










75 








80 


Tyr Val 


Pro 


Glu 


Pro 


Met 


Thr 


Tvr 
y 


Trp 


Gin 


Val 


Leu 


Gly 




Glu 








85 










90 








95 




Thr Leu 


Thr 


Asn 


He 


Tyr 


Asn 


Thr 


Gin 


His 


Arg 


Leu 


Asp 


Arg Gly 


Glu 






100 










105 










110 




lie Ser 


Ala 


Gly 


Glu 


Ala 


Ala 


Val 


Val 


Met 


Thr 


Ser 


Ala 


Gin He 


Thr 




115 










120 










125 






Met Ser 


Thr 


Pro 


Tyr 


Ala 


Ala 


Thr 


Asp 


Ala 


Val 


Leu 


Ala 


Pro His 


He 


130 










135 










140 








Gly Gly 


Glu 


Ala 


Val 


Gly 


Pro 


Gin 


Ala 


Pro 


Pro 


Pro 


Ala 


Leu Thr 


Leu 


145 








150 










155 








160 


Val Phe 


Asp 


Arg 


His 


Pro 


He 


Ala 


Ser 


Leu 


Leu 


Cys 


Tyr 


Pro Ala 


Ala 








165 










170 








175 




Arg Tyr 


Leu 


Met 


Gly 


Ser 


Met 


Thr 


Pro 


Gin 


Ala 


Val 


Leu 


Ala Phe 


Val- 






180 










185 










190 




Ala Leu 


Met 


Pro 


Pro 


Thr 


Ala 


Pro 


Gly 


Thr 


Asn 


Leu 


Val 


Leu Gly 


val 




195 










200 










205 






Leu Pro 


Glu 


Ala 


Glu 


His 


Ala 


Asp 


Arg 


Leu 


Ala 


Arg 


Arg 


Gin Arg 


Pro 


210 










215 










220 








Gly Glu 


Arg 


Leu 


Asp 


Leu 


Ala 


Met 


Leu 


Ser 


Ala 


He 


Arg 


Arg Val 


Tyr 


225 








230 










235 








240 


Asp Leu 


Leu 


Ala 


Asn 


Thr 


Val 


Arg 


Tyr 


Leu 


Gin 


Arg 


Gly 


Gly Arg 


Trp 








245 










250 








255 




Arg Glu 


Asp 


Trp 


Gly 


Arg 


Leu 


Thr 


Gly 


Val 


Ala 


Ala 


Ala 


Thr Pro 


Arg 






260 










265 










270 




Pro Asp 


Pro 


Glu 


Asp 


Gly 


Ala 


Gly 


Ser 


Leu 


Pro 


Arg 


He 


Glu Asp 


Thr 




275 










280 










285 






Leu Phe 


Ala 


Leu 


Phe 


Arg 


Val 


Pro 


Glu 


Leu 


Leu 


Ala 


Pro 


Asn Gly 


Asp 


290 










295 










300 








Leu Tyr 


His 


lie 


Phe 


Ala 


Trp 


Val 


Leu 


Asp 


Val 


Leu 


Ala 


Asp Arg 


Leu 


305 








310 








315 








320 
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Leu Pro Met His Leu Phe Val Leu 
325 

Cys Arg Asp Ala Leu Leu Arg Leu 
340 

Val Thr Thr Ala Gly Ser He Ala 
355 360 
Phe Ala Arg Glu Val Gly Gly Val 
370 375 



Asp Tyr Asp Gin Ser Pro Val Gly 

330 ~ 335 
Thr Ala Gly Met He Pro Thr Arg 
345 350 
Glu He Arg Asp Leu Ala Arg Thr 
365 



<210> 40 
<211> 136 
<212> PRT 
<213> HSV-2 



<400> 40 




Asp 


Leu 


Leu 


Ala Asn Thr Val Arg 
5 


Arg 


Glu 


Asp 


Trp Gly Arg Leu Thr 








20 


Pro 


Asp 


Pro 


Glu Asp Gly Ala Gly 






35 


40 


Leu 


Phe 


Ala 


Leu Phe Arg Val Pro 




50 




55 


Leu 


Tyr 


His 


He Phe Ala Trp Val 


65 






70 


Leu 


Pro 


Met 


His Leu Phe Val Leu 








85 


Cys 


Arg 


Asp 


Ala Leu Leu Arg Leu 








100 


Val 


Thr 


Thr 


Ala Gly Ser He Ala 






115 


120 


Phe 


Ala 


Arg 


Glu Val Gly Gly Val 




130 




135 



<210> 41 
<211> 284 
<212> PRT 
<213> HSV-2 



Tyr 


Leu Gin Arg Gly 


Gly Arg Trp 




10 




Gly 


Val Ala Ala Ala 


Thr Pro Arg 


25 




30 


Ser 


Leu Pro Arg He 


Glu Asp Thr 




45 




Glu 


Leu Leu Ala Pro 


Asn Gly Asp 




60 




Leu 


Asp Val Leu Ala 


Asp Arg Leu 




75 


80 


Asp 


Tyr Asp Gin Ser 


Pro Val Gly 




90 


95 


Thr 


Ala Gly Met He 


Pro Thr Arg 


105 




110 


Glu 


He Arg Asp Leu 


Ala Arg Thr 




125 





<400> 41 




Met 


Gly 


Pro 


Gly Leu Trp Val Val 
5 


Gly 


Gly 


His 


Asp Thr Tyr Trp Thr 








20 


His 


Gly 


Leu 


Gly Leu Ala Arg Thr 






35 


40 


Arg 


Leu 


Trp 


Leu Pro Asn Thr Pro 




50 




55 


Arg 


Leu 


Ala 


Pro Pro Gly Glu Leu 


65 






70 


Met 


Leu 


Arg 


Trp Tyr Ala Glu Arg 








85 


Ala 


Glu 


Phe 


Pro Arg Asp Pro Gly 








100 


Tyr 


Leu 


Leu 


Gly Arg Pro Arg Asn 






115 


120 


Ala 


Gly 


Pro 


Thr Ser Arg Pro Pro 



Met 


Gly Val Leu Val 


Gly Val Ala 




10 


15 


Glu 


Gin He Asp Pro 


Trp Phe Leu 


25 




30 


Tyr 


Trp Arg Asp Thr 


Asn Thr Gly 




45 




Asp 


Ala Ser Asp Pro 


Gin Arg Gly 




60 




Asn 


Leu Thr Thr Ala 


Ser Val Pro 




75 


80 


Phe 


Cys Phe Val Leu 


Val Thr Thr 




90 


95 


Gin 


Leu Leu Tyr He 


Pro Lys Thr 


105 




110 


Ala 


Ser Leu Pro Glu 


Leu Pro Glu 




125 




Ala 


Glu Val Thr Gin 


Leu Lys Gly 
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130 
Leu Ser His 
145 

Val Thr Phe 

Gly Asp Asp 

Pro Pro Pro 
195 

Leu Ser He 

210 
Arg Gly Leu 
225 

Ala Ser Thr 

Gly Cys Asp 

Gly Leu Val 
275 

<210> 42 
<211> 15 
<212> PRT 
<213> HSV-2 

<400> 42 

Ser Leu Pro Arg lie Glu Asp Thr Leu Phe Ala Leu Phe Arg Val 
5 10 15 



<210> 43 
<211> 15 
<212> PRT 
<213> HSV-2 

<400> 43 

Gly Ser He Ala Glu He Arg Asp Leu Ala Arg Thr Phe Ala Arg 
5 10 15 



<210> 44 
<211> 16 
<212> PRT 
<213> HSV-2 

<400> 44 

Glu He Arg Asp Leu Ala Arg Thr Phe Ala Arg Glu Val Gly Gly Val 
5 10 15 

<210> 45 
<211> 838 
<212> PRT 
<213> HSV-2 

<400> 45 
Met Gly Pro 

Gly Gly His 



Asn Pro 

Ala Ala 
165 
Gly Ala 
180 

Gly Pro 

Ala His 

Leu Arg 

Trp Pro 
245 
Ala Ala 
260 

He Ser 



135 
Gly Ala Ser 
150 

Ala Pro Asp 

Thr Glu Arg 

Pro Ala Gly 
200 

Leu His Asn 

215 
Thr Pro Gly 
230 

Val Gly Val 

Leu Val Arg 

Met Arg Asp 
280 



Ala Leu 

Arg Glu 
170 
His Pro 
185 

Thr Pro 

Ala Ser 

Arg Tyr 

Trp Thr 
250 
Ala Arg 
265 

Ser Pro 



140 
Leu Arg 
155 

Gly Leu 

Asp Gly 

Arg His 

Val Thr 
220 
Val Tyr 
235 

Thr Gly 
Tyr Gly 
Pro Ala 



Ser Arg 

Thr Phe 

Arg Arg 
190 
Pro Thr 
205 

Trp Leu 

Leu Ser 

Gly Leu 

Lys Gly 
270 



Ala Trp 
160 
Pro Arg 
175 

Asn Ala 

Thr Asn 

Ala Ala 

Pro Ser 
240 
Ala Phe 
255 

Phe Met 



Gly Leu Trp Val Val Met Gly Val Leu Val Gly Val Ala 

5 10 15 

Asp Thr Tyr Trp Thr Glu Gin He Asp Pro Trp Phe Leu 
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25 



20 



His Gly 


Leu 


Gly Leu 




35 




Arg Leu 


Trt) 


Leu Pro 


50 






Arg Leu 


Ala 


Pro Pro 


65 






Met Leu 


Arg 


Trp Tyr 






85 


Ala Glu 


Phe 


Pro Arg 






100 


Tyr Leu 


Leu 


Gly Arg 




115 




Ala Gly 


Pro 


Thr Ser 


130 






Leu Ser 


His 


Asn Pro 


145 






Val Thr 


Phe 


Ala Ala 






165 


Gly Asp 


Asp 


Gly Ala 






180 


Pro Pro 


Pro 


Gly Pro 




195 




Leu Ser 


lie 


Ala His 


210 






Arg Gly 


Leu 


Leu Arg 


225 






Ala Ser 


Thr 


Trp Pro 






245 


Glv Cvs 


Asp 


Ala Ala 






260 


Gly Leu 


Val 


He Ser 




275 




Val Pro 


Ala 


Ast> Lvs 


290 






Asn Ala 


Pro 


Ala Val 


305 






Val Phe 


Val 


Leu Gly 






325 


Asp Ala 


Leu 


Arg Arg 






340 


Ala Gin 


Tvr 


Met Ser 




355 




Gly Ser 


Glv 


Thr Asp 


370 






Leu Leu 


Ala 


Ser Ser 


385 






Asp Ala 


He 


Arg Leu 






405 


Val Leu 


Ala 


Gly Leu 






420 


Ser Val 


Phe 


Leu Asn 




435 




Leu Glu 


Ala 


Arg Leu 


450 






Gin Ser 


Leu 


Val Ala 


4 65 






Asp Ser 


Pro 


Ala Ala 



25 30 
Ala Arg Thr Tyr Trp Arg Asp Thr Asn Thr Gly 

40 45 
Asn Thr Pro Asp Ala Ser Asp Pro Gin Arg Gly 

55 60 
Gly Glu Leu Asn Leu Thr Thr Ala Ser Val Pro 
70 75 80 

Ala Glu Arg Phe Cys Phe Val Leu Val Thr Thr 

90 95 
Asp Pro Gly Gin Leu Leu Tyr He Pro Lys Thr 

105 110 
Pro Arg Asn Ala Ser Leu Pro Glu Leu Pro Glu 

120 125 
Arg Pro Pro Ala Glu Val Thr Gin Leu Lys Gly 

135 140 
Gly Ala Ser Ala Leu Leu Arg Ser Arg Ala Trp 
150 155 ~ 160 

Ala Pro Asp Arg Glu Gly Leu Thr Phe Pro Arg 

170 175 
Thr Glu Arg His Pro Asp Gly Arg Arg Asn Ala 

185 190 
Pro Ala Gly Thr Pro Arg His Pro Thr Thr Asn 

200 205 
Leu His Asn Ala Ser Val Thr Trp Leu Ala Ala 

215 220 
Thr Pro Gly Arg Tyr Val Tyr Leu Ser Pro Ser 
230 235 240 

Val Gly Val Trp Thr Thr Gly Gly Leu Ala Phe 

250 255 
Leu Val Arg Ala Arg Tyr Gly Lys Gly Phe Met 

265 270 
Met Arg Asp Ser Pro Pro Ala Glu He He Val 

280 285 
Thr Leu Ala Arg Val Gly Asn Pro Thr Asp Glu 

295 300 
Leu Pro Gly Pro Pro Ala Gly Pro Arg Tyr Arg 
310 315 320 

Ala Pro Thr Pro Ala Asp Asn Gly Ser Ala Leu 

330 335 
Val Ala Gly Tyr Pro Glu Glu Ser Thr Asn Tyr 

345 350 
Arg Ala Tyr Ala Glu Phe Leu Gly Glu Asp Pro 

360 365 
Ala Arg Pro Ser Leu Phe Trp Arg Leu Ala Gly 

375 380 
Gly Phe Ala Phe Val Asn Ala Ala His Ala His 
390 395 400 

Ser Asp Leu Leu Gly Phe Leu Ala His Ser Arg 

410 415 
Ala Ala Arg Gly Ala Ala Gly Cys Ala Ala Asp 

425 * 430 

Val Ser Val Leu Asp Pro Ala Ala Arg Leu Arg 

440 445 
Gly His Leu Val Ala Ala He Leu Glu Arg Glu 

455 460 
His Ala Leu Gly Tyr Gin Leu Ala Phe Val Leu 
470 475 480 

Tyr Gly Ala Val Ala Pro Ser Ala Ala Arg Leu 
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485 










490 495 




lie Asp 


Ala 


Leu 


Tyr 


Ala 


Glu 


Phe 


Leu 


Gly Gly Arg Ala Leu Thr 


Ala 






500 










505 


510 




Pro Met 


Val 


Arg 


Arg 


Ala 


Leu 


Phe 


Tyr 


Ala Thr Ala Val Leu Arg 


Ala 




515 










520 




525 




Pro Phe 


Leu 


Ala 


Gly 


Ala 


Pro 


Ser 


Ala 


Glu Gin Arg Glu Arg Ala 


Arg 


530 










535 






540 


Arg Gly 


Leu 


Leu 


He 


Thr 


Thr 


Ala 


Leu 


Cys Thr Ser Asp Val Ala 


Ala 


545 








550 








555 


560 


Ala Thr 


His 


Ala 


Asp 


Leu 


Ara 


Ala 


Ala 


Leu Ala Arg Thr Asp His 


Gin 








565 










570 575 




Lys Asn 


Leu 


Phe 


Trp 


Leu 


Pro 


Asp 


His 


Phe Ser Pro Cvs Ala Ala 


Ser 






580 










585 


590 




Leu Arg 


Phe 


Asp 


Leu 


Ala 


Glu 


Gly 


Gly 


Phe He Leu Asp Ala Leu 


Ala 




595 










600 




605 




Met Ala 


Thr 


Arg 


Ser 


Asp 


He 


Pro 


Ala 


Asp Val Met Ala Gin Gin 


Thr 


610 










615 






620 




Arg Gly 


Val 


Ala 


Ser 


Val 


Leu 


Thr 


Arg 


Trp Ala His Tyr Asn Ala 


Leu 


625 








630 








635 


640 


lie Arg 


Ala 


Phe 


Val 


Pro 


Glu 


Ala 


Thr 


His Gin Cys Ser Gly Pro 


Ser 








645 










650 655 




His Asn 


Ala 


Glu 


Pro 


Arg 


He 


Leu 


Val 


Pro He Thr His Asn Ala 


Ser 






660 










665 


670 




Tyr Val 


Val 


Thr 


His 


Thr 


Pro 


Leu 


Pro 


Arg Gly He Gly Tyr Lys 


Leu 




675 










680 




685 




Thr Gly 


Val 


Asp 


Val 


Arcr 


Arq 


Pro 


Leu 


Phe He Thr Tyr Leu Thr 


Ala 


690 










695 






700 




Thr Cys 


Glu 


Gly 


His 


Ala 


Arg 


Glu 


He 


Glu Pro Lys Arg Leu Val 


Arg 


705 








710 








715 


720 


Thr Glu 


Asn 


Ara 


Arq 


Asp 


Leu 


Gly 


Leu 


Val Gly Ala Val Phe Leu 


Arg 








725 










730 735 


Tyr Thr 


Pro 


Ala 


Gly 


Glu 


Val 


Met 


Ser 


Val Leu Leu Val Aso Thr* 


Asp 






740 










745 


750 


Ala Thr 


Gin 


Gin 


Gin 


Leu 


Ala 


Gin 


Gly 


Pro Val Ala Gly Thr Pro 


Asn 




755 










760 




765 




Val Phe 


Ser 


Ser 


Asp 


Val 


Pro 


Ser 


Val 


Ala L^n l\i=»n Tit*n PTie* Prn 


Aon 


770 










775 






780 




Gly Thr 


Val 


He 


His 


Leu 


Leu 


Ala 


Phe 


Asd Th:r Leu Pro He Ala 


Thr 


785 








790 








795 


800 


He Ala 


Pro 


Gly 


Phe 


Leu 


Ala 


Ala 


Ser 


Ala Leu Gly Val Val Met 


He 








805 










810 815 




Thr Ala 


Ala 


Leu 


Ala 


Gly 


He 


Leu 


Arq 


Val Val Arg Thr Cys Val 


Pro 






820 










825 


830 




Phe Leu 


Trp 


Arg 


Arg 


Glu 














835 


















<210> 46 


















<211> 215 


















<212> PRT 


















<213> HSV-2 


















<400> 46 


















Ser Ser 


Ser 


Ala 


Ser 


Ser 


Ser 


Ser 


Ala 


Ser Ser Ser Ser Ala Ser 


Ser 








5 










10 15 




Ser Ala Gly 


Gly 


Ala 


Gly 


Gly 


Ser 


Val 


Ala Ser Ala Ser Gly Ala 


Gly 






20 










25 


30 


Glu Arg Arg 


Glu 


Thr 


Ser 


Leu 


Gly 


Pro 


Arg Ala Ala Ala Pro Arg 


Gly 




35 










40 




45 
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Pro 


Arcr 


Lvs 


Cys 


Ala Arg Lys Thr Arg 




50 






55 


Pro 


Glv 


Ala 


Arg 


Asp Pro Ala Pro Gly 


65 








70 


Ala 


Glv 


Val 


Ser 


Ser Val Val Ala Leu 










85 


Val 


Thr 


Glv 


Asp 


Cvs Lpu Pto Val T.pii 








100 


105 


Gly 


Ala 


Tvr 


Val 


Val T.pii Val Aqd Gl n 

VOX UCU VOX ngy vJ.il 






115 




120 


Leu 


Arg 


Ala 


Al A 


AT a Prn AT a Trn Ca r 
fU.a rx u nia up Oci 




130 






135 


His 


Ala 


Arg 


Asn 


Cys Val Arg Pro Pro 


145 








150 


Ser 


Glu 


Trp 


Asn 


Ser Leu Trp Met Thr 










165 


Asp 


Gin 


Gly 


Thr 


Leu Val Gly Ala Leu 








180 


185 


Arg 


His 


Pro 


Trp 


Ser Arg Glu Gin Gly 






195 




200 


Ala 


Pro 


Ala 


Gly 


His Gly Glu 




210 






215 



<210> 47 
<211> 826 
<212> PRT 
<213> HSV-2 



<400> 47 



Met 


Glu 


Pro 


Arg 


Pro Gly Thr Ser Ser 
5 


Arg 


Pro 


Pro 


Arg 


Gin Thr Pro Gly Thr 








20 


25 


Trp 


Gly 


Met 


Leu 


Asn Asp Met Gin Trp 






35 




40 


Glu 


Glu 


Thr 


Glu 


Val Gly He Ser Asp 




50 






55 


Thr 


Ser 


Glu 


Ala 


Gly Ser Thr Asp Thr 


65 








70 


Met 


Asp 


Ala 


Ala 


Thr Pro Pro Ala Arg 










85 


Ser 


Pro 


Thr 


Pro 


Ala Asp Ala Gin Gly 








100 


105. 


Gly 


Glu 


Glu 


Glu 


Ala Glu Ala Gly Gly 






115 




120 


Cys 


Thr 


Asp 


Glu 


He Ala Pro Pro Leu 




130 






135 


Leu 


His 


Pro 


Phe 


Cys He Pro Cys Met 


145 








150 


Asn 


Thr 


Cys 


Pro 


Leu Cys Asn Thr Pro 










165 


Val 


Thr 


Ala 


Ser 


Gly Ser Phe Ser Thr 








180 


185 


Arg 


Thr 


Arg 


Val 


Glu Ala Glu Ala Ala 






195 




200 


Asp 


Phe 


He 


Trp 


Thr Gly Asn Pro Arg 




210 






215 



PCT/US01/20981 
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His 


Ala Glu Glv Glv Pro 


Glu 




60 




Leu 


ThT* Arcr Tvt Ti^u Pto 


He 




75 ' 


80 


Ala 


Prn Tvt* Val Asn Ti\/<^ 
nu x y x. vdJ- noil xjyo 


Thr 

X XiX 


90 


95 




Zi or-> 

nO^J 


Mpf Rln Thr HI v W-i «3 
ncL vjxu jl iix uiy nio 


Tl a 

X JL fcl 




110 




Thr 


Gly Asn Val Ala Asp 


Leu 




125 




Arg 


Arg Thr Leu Leu Pro 


Glu 




140 




Asp 


Tyr Pro Thr Pro Pro 


Ala 




155 


160 


Pro 


Val Gly Asn Met Leu 


Phe 


170 


175 




Asp 


Phe His Gly Leu Arg 


Ser 




190 




Ala 


Pro Ala Pro Ala Gly 


Asp 




205 




Arg 


AT a Aon Prn fZ"\ \r *D 
nj. ca nop it x kJ uXj JZJLU 


LI 


10 


15 




Gin 


D-rn Ala Ala Prr> H-? <a 
t JL v nxa nXd JTIU ilJLo 


Al a 
nxa 




30 




Leu 


fij- a ocx ocx no£»> ocx 


CI n 








21 on 

no^j 


nop lieu nxo nl y nop 


Ser 




60 




Glu 


Met" Php> filn Ala Civ 
i j cl rue oiu nXa v?JLy 


Leu 




75 


AO 


Pro 


PTC* Ala fil n Arrr m r> 
^ nxa uiu nxy V3_l_ll 


ni w 

uxy 


90 


95 




Ser 


Cvs Glv Glv d \/ Prn 


Val 

V ai 




110 




Gly 


Glv Asn Val rv<? Ala 


Val 
v ax 




125 




Arg 


Cys Gin Ser Phe Pro 


Cys 




140 




Lys 


Thr Trp He Pro Leu 


Arg 




155 


160 


Val 


Ala Tyr Leu He Val 


Gly 


170 


175 




He 


Pro He Val Asn Asp 


Pro 




190 




Val 


Arg Ala Gly Thr Ala 


Val 




205 




Thr 


Ala Pro Arg Ser Leu 
•?9n 


Ser 



220 
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Leu 


Gly 


Gly 


His 


Thr Val Arg Ala Leu Ser 


Pro Thr Pro Pro Trp Pro 


225 








230 


235 240 


Gly 


Thr 


Asp 


Asp 


Glu Asp Asp Asp Leu Ala 


Asp Gly Val Asp Tyr Val 










245 250 


255 


Pro 


Pro 


Ala 


Pro 


Arg Arg Ala Pro Arg Arg 


Gly Gly Gly Gly Ala Gly 








260 


265 


270 


Ala 


Thr 


Arg 


Gly 


Thr Ser Gin Pro Ala Ala 


Thr Arg Pro Ala Pro Pro 






275 




280 


285 


Gly 


Ala 


Pro 


Arg 


Ser Ser Ser Ser Gly Gly 


Ala Pro Leu Arg Ala Gly 




290 






295 


300 


Val 


Gly 


Ser 


Gly 


Ser Gly Gly Gly Pro Ala 


Val Ala Ala Val Val Pro 


305 








310 


315 320 


Arg 


Val 


Ala 


Ser 


Leu Pro Pro Ala Ala Gly 


Gly Gly Arg Ala Gin Ala 










325 330 


335 


Arg 


Arg 


Val 


Gly 


Glu Asp Ala Ala Ala Ala 


Glu Gly Arg Thr Pro Pro 








340 


345 


350 


Ala 


Arg 


Gin 


Pro 


Arg Ala Ala Gin Glu Pro 


Pro lie Val lie Ser Asp 






355 




360 


365 


Ser 


Pro 


Pro 


Pro 


Ser Pro Arg Arg Pro Ala 


Gly Pro Gly Pro Leu Ser 




370 






375 


380 


Phe 


Val 


Ser 


Ser 


Ser Ser Ala Gin Val Ser 


Ser Gly Pro Gly Gly Gly 


385 








390 


395 400 


Gly 


Leu 


Pro 


Gin 


Ser Ser Gly Arg Ala Ala 


Arg Pro Arg Ala Ala Val 










405 "* 410 


415 


Ala 


Pro 


Arg 


Val 


Arg Ser Pro Pro Arg Ala 


Ala Ala Ala Pro Val Val 








420 


425 


430 


Ser 


Ala 


Ser 


Ala 


Asp Ala Ala Gly Pro Ala 


Pro Pro Ala Val Pro Val 






435 




440 


445 


Asp 


Ala 


His 


Arg 


Ala Pro Arg Ser Arg Met 


Thr Gin Ala Gin Thr Asp 




450 






455 


460 


Thr 


Gin 


Ala 


Gin 


Ser Leu Gly Arg Ala Gly 


Ala Thr Asp Ala Arg Gly 


465 








470 


475 480 


Ser 


Gly 


Gly 


Pro 


Gly Ala Glu Gly Gly Pro 


Gly Val Pro Arg Gly Thr 










485 490 


495 


Asn 


Thr 


Pro 


Gly 


Ala Ala Pro His Ala Ala 


Glu Gly Ala Ala Ala Arg 








500 


505 


510 


Pro 


Arq 


Lvs 


Arg 


Arg Gly Ser Asp Ser Gly 


Pro Ala Ala Ser Ser Ser 






515 




520 


525 


Ala 


Ser 


Ser 


Ser 


Ala Ala Pro Arg Ser Pro 


Leu Ala Pro Gin Gly Val 




530 






535 


540 


Gly 


Ala 


Lys 


Arg 


Ala Ala Pro Arg Arg Ala 


Pro Asp Ser Asp Ser Gly 


545 








550 


555 560 


Asp 


Arg 


Gly 


His 


Gly Pro Leu Ala Pro Ala 


Ser Ala Gly Ala Ala Pro 










565 570 


575 


Pro 


Ser 


Ala 


Ser 


Pro Ser Ser Gin Ala Ala 


Val Ala Ala Ala Ser Ser 








580 


585 


590 


Ser 


Ser 


Ala 


Ser 


Ser Ser Ser Ala Ser Ser 


Ser Ser Ala Ser Ser Ser 






595 




600 


605 


Ser 


Ala 


Ser 


Ser 


Ser Ser Ala Ser Ser Ser 


Ser Ala Ser Ser Ser Ser 




610 






615 


620 


Ala 


Ser 


Ser 


Ser 


Ala Gly Gly Ala Gly Gly 


Ser Val Ala Ser Ala Ser 


625 








630 


635 640 


Gly 


Ala 


Gly 


Glu 


Arg Arg Glu Thr Ser Leu 


Gly Pro Arg Ala Ala Ala 










645 650 


655 


Pro 


Arg 


Gly 


Pro 


Arg Lys Cys Ala Arg Lys 


Thr Arg His Ala Glu Gly 








660 


665 


670 


Gly 


Pro 


Glu 


Pro 


Gly Ala Arg Asp Pro Ala 


Pro Gly Leu Thr Arg Tyr 
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675 




680 


Leu 


Pro 


He 


Ala Gly Val 


Ser Ser 




690 






695 


Asn 


Lvs 


Thr 


Val Thr Gly 


Asp Cys 


705 






710 




Glv 


His 


He 


Glv Ala Tvr 


Val Val 








725 




Ala 


Asp 


Leu 


T,«sn Attt Al a 
ucu niy fix a. 


Al a Ala 








740 




Leu 


Pro 


Glu 


His Ala Arg 


Asn Cys 






755 




760 


Pro 


Pro 


Ala 


Ser Glu Trp 


Asn Ser 




770 






775 


Met 


Leu 


Phe 


Asp Gin Gly 


Thr Leu 


785 






790 




Leu 


Arg 


Ser 


Arg His Pro 


Trp Ser 








805 




Ala 


Gly 


Asp 


Ala Pro Ala 


Gly His 








820 





685 



Val 


Val Ala Leu 


Ala Pro Tyr Val 




700 




Leu 


Pro Val Leu 


Aso Met Glu Thr 




715 




Leu 


Val Asd Gin 


Thr Glv Asn Val 

J. iii. noil » Qj. 




7^n 


/ jj 


Pro 


Ala Trp Ser 


Arg Arg Thr Leu 


745 




750 


Val 


Arg Pro Pro 


Asp Tyr Pro Thr 






765 


Leu 


Trp Met Thr 


Pro Val Gly Asn 




780 




Val 


Gly Ala Leu 


Asp Phe His Gly 




795 


800 


Arg 


Glu Gin Gly 


Ala Pro Ala Pro 




810 


815 


Gly 


Glu 




825 







<210> 48 
<211> 3350 
<212> DNA 
<213> HSV-2 



<220> 

<221> misc_feature 

<222> 1027, 1034, 1054, 1055, 1056, 1057, 1058, 1059, 1060, 1061, 
1062, 1063, 1064, 1065, 1066, 1067, 1068, 1069, 1070, 1071, 
1072, 1073, 1074, 1075, 1076, 1077, 1078, 1079, 1080, 1081, 
1082, 1083, 1084, 1085, 1086, 1087, 1088, 1089, 1090 
<223> n = A, T,C or G 

<221> misc_feature 

<222> 1091, 1092, 1093, 1094, 1095, 1096, 1097, 1098, 1099, 1100/ 

1101, 1102, 1103, 1104, 1105, 1106, 1107, 1108, 1109, 1110, 

1111, 1112, 1113, 1114, 1115, 1116, 1117, 1118, 1119, 1120, 

1121, 1122, 1123, 1124, 1125, 1126, 1127, 1128, 1129 
<223> n = A,T,C or G 



<221> misc_feature 

<222> 1130, 1131, 1132, 1133, 1134, 1135, 1136, 1137, 1138, 1139, 
1140, 1141, 1142, 1143, 1144, 1145, 1146, 1147, 1148, 1149, 
1150, 1151, 1152, 1327, 1364, 1390, 1392 
<223> n = A,T,C or G 



<400> 48 

ccgtcggtga cctgcaggag ctcgtttatt 
agcccctggg gagacaggtc cacggagtcc 
aggctgtagt ccccccaggc ccccaggtca 
gccgggctgg ggggcgcgtc ctcggtcgcg 
cgctgatcgg ccgagctggc gaagggcgcc 
ttccacgtgt cgtggagttc ctgaacgaac 
cgcgcggcct gatagccggc cgagaggcgc 
cagaacccgg ggacctggtc ccccgacatc 
acgatggcca ggaacccgtg gatttcccgc 
gagacctggg ccgccagctc gtcgcatacc 
gaccgcagcc ccgcgaggga cgcgaccagc 



aatagccagt ccatgctcag cgtagcggcc 60 
ggaaccaccg tcggctgacc caggggcccc 120 
tgacggttcg tgagcacgac gaggtctgcg 180 
tgggccatca cctcctgaat ggctgcggtg 240 
acgaccagcg cgcgctccgt ctgcaggccc 300 
tcggccaccc gctcggggcc cgtggccgcg 360 
cgccagcgcg ccaggaactg actcatgtaa 420 
aactttgacg ccctggcgtg gatgcccgac 480 
cgcacgacgg ccagcacgtt accctcgtgc 540 
ccgaggtgcg ccgtcgtctc ggtgacgacg 600 
gcgcgcttgg cgtcgtgata catgccgcag 660 
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tactggctca ccgcgtcgcc catggcctcg gggcgccagg gccccaggcg ctcgtgggcg 720 
tctgcgacca cggcgtacag gcggtgcccg tcgctctcga accggcactc aaagaaggcg 780 
gcgagcgtgc gcatgtgaag ccgcagcagc acgatcgcgt cctccagctg gcggaccagg 840 
gggtcggcgc gctcggcgag ctcctgcagc accccccggg ccgccagggc gtacatgctg 900 
atcagcagca ggctgctgcc cacctcggga ggctgggggg gaggcagctg gaccgcgggc 960 
cgcagctgct cgacggcccc cctggcgatc acgtacagct cgcgcagcag ctgctcgatg 1020 
ttgtcghgcc atcntgcatc gtgggcccga cgcnnnnnnn nnnnnnnnnn nnnnnnnnnn 1080 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 1140 
nnnnnnnnnn nnaggccagc acccgcaggg caaactcgat ggggcggggc aggtaggcag 1200 
cgttgcacgt ggccctcagc gcgtccccga ccaccagggc cagcacgtaa gggacgaacc 1260 
ccgggtcggc gaggacgttg gggtggatgc cctccagggc cgggaagcgg atcttggtgg 1320 
ccgcggncag gtgaaccgag ggggcgtggc taggcggccc gacngggagc atcgcggaca 1380 
gcggcgtggn cngggtggtg ggggtcaggt cccagtgggt ctggccgtac acgtcgagcc 1440 
agatgagcgc cgtctcgcgc aggaggctgg gctggccggc gctgaagcgg cgctcggccg 1500 
tctcaaactc ccccacgagc gtgcgccgca ggctcgccag gtgttccgtc ggcacggccg 1560 
ggcccatgat gcgcgccagc gtctggctga ggacgccgcc cgacaggccg accgcctcac 1620 
agagccgccc gtgcgtgtgc tcgctggcgc cctggatccg ccggaacgtt ttcacgtagc 1680 
cggcgtagtg cccgtactcc cgcgcgagcc cgaacacgtt cgcccccgca agggcaatgc 1740 
acccaaagag ctgctggatc tcgctgagcc cgtggccggg gggcgtccgc gcgggcaccc 1800 
ccgccaccaa aaacccctcc agggccgata tgtactgggt gcagtgcgcg ggcgtgaacc 1860 
ccgcgtcggt aagcgtgttg atcaccacgg agggcgagtt gctgttctgg accaaagccc 1920 
acgtctgctg cagcagcgcg aggagccgtt gctgggcccc ggcggagggc ggctccccta 1980 
gctgcagcag gccggtgacg gccggacgga agatggccag cgccgacgca ctcagaaacg 2040 
gcacgtcggg gtcgaagacg gccgcgtccg tccgcacgcg cgccatcagc gtccccgggg 2100 
gcgcgcacgc cgaccgcggg ctgacgcggc ttagggcggt cgacacgcgc acctcctcgc 2160 
gactgcgaac cattttggtg gcctcgaggg gcgggatcat gatagccggg tcgatctccc 2220 
gcaccgtgtg ctgaaactgg gccagcagcg gcggcgggac caccgcgccc cgatcggggg 2280 
tcgtcaggta ctcgtccacc agcgccagcg taaacagggc ccgcgtgagg ggggtcaggg 234 0 
cggcgtcgtc gatgcgctgt aggtgcgccg agaacagcgt cacccaattg ctgaccaggg 2400 
ccaagaaccg gagaccctct tgcacgatcg gggacgggaa gagcaggctg tacgccgggg 24 60 
tggtcaggtt ggcgccgggt tgccccaggg gaaccgggga catcttaagc gacatctccc 2520 
cgagggcctc cagggaggtc cgcgggttca tggccaggca gctctgggtg acggtccgcc 2580' 
agcggtcgat ccactccacg gcacactggc ggacgcgcac cggccccagg gccgccgtgg 2640 
tgcgcagccc ggcggcctcc agcgcgtggg tcgtgtcgga gccggtgatc gccaggaccg 2700 
tgtccttgat gacgtccatc tcccggaagg ccgcctcggg ggtctcgggg agcgccaccg 2760 
ccatgcggtg caccagcagc ccggggaggt tctcggccaa gagcgccgtc tccggaagcc 2820 
cgtgggcccg gtgcaaggcg cacagttgct ccaggagcgg gtgccagcac gcccgcgcct 2880 
ccgccgggcc gaccgccgcg cccgacaaca gaaacgccgc cgtggcggcg cgcagtttgg 2940 
ccgcggacag aaacgccggc tcgtccgcgc tgcccgccgg ctcgctcgag ggggagggcg 3000 
gccggcggag gttggtcagg ctccccaaca ggacctgcaa cggtccgttt gggggtggag 3060 
cggacggggg ggtcatgccg gcgggcgccg ggacctggag cgcgctgtcc gacatggcga 3120 
ccggcgtgcg cgctcggcga cgcggcgcgg agaccgcggg cccaaacggg aatgactgcc 3180 
gccgccctat acggaggggc taagtatcgc ccggggaccc ttcgaaaccc cgggcgtgtc 3240 
gcaagtacgc cgcgaaggcg cggcgtgtta tacggcgcgt tatgtcccgg cattccgttc 3300 
gtgggttcgg gcccgggtgc tgtcgggtgg gagtgtgtgt gtgtgggggg • 3350 

<210> 49 
<211> 3345 
<212> DNA 
<213> HSV-2 

<400> 49 

atgtcggaca gcgcgctcca ggtcccggcg cccgccggca tgaccccccc gtccgctcca 60 

cccccaaacg gaccgttgca ggtcctgttg gggagcctga ccaacctccg ccggccgccc 120 

tccccctcga gcgagccggc gggcagcgcg gacgagccgg cgtttctgtc cgcggccaaa 180 

ctgcacgccg ccacggcggc gtttctgttg tcgggcgcgg cggtcggccc ggcggaggcg 240 

cgggcgtgct ggcacccgct cctggagcaa ctgtgcgcct tgcaccgggc ccacgggctt 300 

ccggagacgg cgctcttggc cgagaacctc cccgggctgc tggtgcaccg catggcggtg 360 
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gcgctccccg agacccccga ggcggccttc cgggagatgg acgtcatcaa ggacacggtc 420 
ctggcgatca ccggctccga cacgacccac gcgctggagg ccgccgggct gcgcaccacg 480 
gcggccctgg ggccggtgcg cgtccgccag tgtgccgtgg agtggatcga ccgctggcgg 540 
accgtcaccc agagctgcct ggccatgaac ccgcggacct ccctggaggc cctcggggag 600 
atgtcgctta agatgtcccc ggttcccctg gggcaacccg gcgccaacct gaccaccccg 660 
gcgtacagcc tgctcttccc gtccccgatc gtgcaagagg gtctccggtt cttggccctg 720 
gtcagcaatt gggtgacgct gttctcggcg cacctacagc gcatcgacga cgccgccctg 780 
acccccctca cgcgggccct gtttacgctg gcgctggtgg acgactacct gacgaccccc 840 
gatcggggcg cggtggtccc gccgccgctg ctggcccagt ttcagcacac ggtgcgggag 900 
atcgacccgg ctatcatgat cccgcccctc gaggccacca aaatggttcg cagtcgcgag 960 
gaggtgcgcg tgtcgaccgc cctaagccgc gtcagcccgc ggtcggcgtg cgcgcccccg 1020 
gggacgctga tggcgcgcgt gcggacggac gcggccgtct tcgaccccga cgtgccgttt 1080 
ctgagtgcgt cggcgctggc catcttccgt ccggccgtca ccggcctgct gcagctaggg 1140 
gagccgccct ccgccggggc ccagcaacgg ctcctcgcgc tgctgcagca gacgtgggct 1200 
ttggtccaga acagcaactc gccctccgtg gtgatcaaca cgcttaccga cgcggggttc 1260 
acgcccgcgc actgcaccca gtacatatcg gccctggagg ggtttttggt ggcgggggtg 1320 
cccgcgcgga cgccccccgg ccacgggctc agcgagatcc agcagctctt tgggtgcatt 1380 
gcccttgcgg gggcgaacgt gttcgggctc gcgcgggagt acgggcacta cgccggctac 1440 
gtgaaaacgt tccggcggat ccagggcgcc agcgagcaca cgcacgggcg gctctgtgag 1500 
gcggtcggcc tgtcgggcgg cgtcctcagc cagacgctgg cgcgcatcat gggcccggcc 1560 
gtgccgacgg aacacctggc gagcctgcgg cgcacgctcg tgggggagtt tgagacggcc 1620 
gagcgccgct tcagcgccgg ccagcccagc ctcctgcgcg agacggcgct catctggctc 1680 
gacgtgtacg gccagaccca ctgggacctg acccccacca ccccggccac gccgctgtcc 1740 
gcgctgctcc ccgtcgggcc gcctagccac gccccctcgg ttcacctggc cgcggccacc 1800 
aagatccgct tcccggccct ggagggcatc caccccaacg tcctcgccga cccggggttc 18 60 
gtcccttacg tgctggccct ggtggtcggg gacgcgctga gggccacgtg caacgctgcc 1920 
tacctgcccc gccccatcga gtttgccctg cgggtgctgg cctgggcgcg cgacttcggc 1980 
ctgggctatc tccccaccgt cgaggggcac cgcacaaaat tgggcgcgct gatcaccctc 2040 
ctcgaaccgg ccacccgggc cggcgtcggg cccacgatgc agatggccga caacatcgag 2100 
cagctgctgc gcgagctgta cgtgatcgcc aggggggccg tcgagcagct gcggcccgcg 2160 
gtccagctgc ctccccccca gcctcccgag gtgggcagca gcctgctgct gatcagcatg 2220 
tacgccctgg cggcccgggg ggtgctgcag gagctcgccg agcgcgccga ccccctggtc 2280 
cgccagctgg aggacgcgat cgtgctgctg cggctgcaca tgcgcacgct cgccgccttc 2340 
tttgagtgcc ggttcgagag cgacgggcac cgcctgtacg ccgtggtcgc agacgcccac 2400 
gagcgcctgg ggccctggcg ccccgaggcc atgggcgacg cggtgagcca gtactgcggc 24 6.0 
atgtatcacg acgccaagcg cgcgctggtc gcgtccctcg cggggctgcg gtccgtcgtc 2520 
accgagacga cggcgcacct cggggtatgc gacgagctgg cggcccaggt ctcgcacgag 2580 
ggtaacgtgc tggccgtcgt gcggcgggaa atccacgggt tcctggccat cgtgtcgggc 2 640 
atccacgcca gggcgtcaaa gttgatgtcg ggggaccagg tccccgggtt ctgttacatg 2700 
agtcagttcc tggcgcgctg gcggcgcctc tcggccggct atcaggccgc acgcgcggcc 2760 
acgggccccg agcgggtggc cgagttcgtt caggaactcc acgacacgtg gaagggcctg 2820 
cagacggagc gcgcgctggt cgtggcgcgc ttcgccagct cggccgatca gcgcaccgca 2880 
gccattcagg aggtgatggc ccacgcgacc gaggacgcgc cccccagccc ggccgcagac 2940 
ctcgtcgtgc tcacgaaccg tcatgacctg ggggcctggg gggactacag cctggggccc 3000 
ctgggtcagc cgacggtggt tccggactcc gtggacctgt ctccccaggg gctggccgct 3060 
acgctgagca tggactggct attaataaac gagctcctgc aggtcaccga cggcgtgttt 3120 
cgcgcctcgg cgtttcggcc ttccgccggc ccgggggccc ccggggacct ggaggcccaa 3180 
gatgccggcg gtagcacccc cgaacccacg acacccggcc cacaggacac gcaggcccgc 3240 
gcgccgtcga cgcgcccggc gggccgcgag acggtccctt ggcccaacac ccccgtggag 3300 
gacgacgaga tgacgccgca ggagacacca ccggtacacc cgtag 3345 

<210> 50 
<211> 993 
<212> PRT 
<213> HSV-2 

<400> 50 

Glu Pro Ala Gly Ser Ala Asp Glu Pro Ala Phe Leu Ser Ala Ala Lys 



WO 02/02131 



PCT/US01/20981 



5 



Leu His 


Ala 


Ala 


Thr 


Ala 


Ala 


Phe 






20 










Pro Ala 


Glu 


Ala 


Arg 


Ala 


Cys 


Trp 




35 










40 


Ala Leu 


His 


Arcr 


Ala 


His 


Gly 


Leu 


50 










55 




Asn Leu 


Pro 


Glv 

J. Jf 


Leu 


Leu 


Val 


His 


65 








70 






Thr Pro 


Glu 


Ala 


Ala 


Phe 


Arg 


Glu 








85 








Leu Ala 


He 


Thr 


Glv 


Ser 


Asp 


Thr 






100 










Leu Arg 


Thr 


Thr 


Ala 


Ala 


Leu 


Gly 




115 










120 


Val Glu 


Trp 


He 


Asp 


Arg 


Trp 


Arg 


130 










135 




Met Asn 


Pro 


Ara 


Thr 


Ser 


Leu 


Glu 


145 








150 






Met Ser 


Pro 


Val 


Pro 


Leu 


Gly 


Gin 








165 








Ala Tyr 


Ser 


Leu 


Leu 


Phe 


Pro 


Ser 






180 










Phe Leu 


Ala 


Leu 


Val 


Ser 


Asn 


Trp 




195 










200 


Gin Arg 


He 


Asn 


Asp 


Ala 


Ala 


Leu 


210 










215 




Thr Leu 


Ala 


Leu 


Val 


Aso 


Asp 


Tvr 


225 








230 






Val Val 


Pro 


Pro 


Pro 


Leu 


Leu 


Ala 








245 








He Asp 


Pro 


Ala 


He 


Met 


He 


Pro 






260 










Arg Ser 


Arcr 


Glu 


Glu 


Val 


Ara 


Val 




275 










280 


Pro Arg 


Ser 


Ala 


Cys 


Ala 


Pro 


Pro 


290 










295 




Thr Asp 


Ala 


Ala 


Val 


Phe 


Asp 


Pro 


305 








310 






Ala Leu 


Ala 


He 


Phe 


Arg 


Pro 


Ala 








325 








Glu Pro 


Pro 


Ser 


Ala 


Gly 


Ala 


Gin 






340 










Gin Thr 


Trp 


Ala 


Leu 


Val 


Gin 


Asn 




355 










360 


Asn Thr 


Leu 


Thr 


Asp 


Ala 


Gly 


Phe 


370 










375 




He Ser 


Ala 


Leu 


Glu 


Gly 


Phe 


Leu 


385 








390 






Pro Pro 


Gly 


His 


Gly 


Leu 


Ser 


Glu 








405 








Ala Leu 


Ala 


Gly 


Ala 


Asn 


Val 


Phe 






420 










Tyr Ala 


Gly 


Tyr 


Val 


Lys 


Thr 


Phe 




435 










440 


His Thr 


His 


Gly 


Arg 


Leu 


Cys 


Glu 


450 










455 




Leu Ser 


Gin 


Thr 


Leu 


Ala 


Arg 


He 





32 






10 15 




Leu 


Leu Ser Gly Ala Ala Val 


Gly 


25 


30 


His 


Pro Leu Leu Glu Gin Leu 


Cys 




45 


Pro 


Glu Thr Ala Leu Leu Ala 


Glu 




60 




Arg 


Met Ala Val Ala Leu Pro 


Glu 




75 


80 


Met 


Asp Val He Lys Asp Thr 


Val 




90 95 




Thr 


His Ala Leu Glu Ala Ala 


Gly 


105 


110 


Pro 


Val Arg Val Arg Gin Cys 


Ala 




125 




Thr 


Val Thr Gin Ser Cys Leu 


Ala 




140 




Ala 


Leu Gly Glu Met Ser Leu 


Lys 




155 


160 


Pro 


Gly Ala Asn Leu Thr Thr 


Pro 




170 175 




Pro 


He Val Gin Glu Gly Leu 


Arg 


185 


190 




Val 


Thr Leu Phe Ser Ala His 


Leu 




205 




Thr 


Pro Leu Thr Arg Ala Leu 


Phe 




220 




Leu 


Thr Thr Pro Asp Arg Gly 


Ala 




235 


240 


Gin 


Phe Gin His Thr Val Arg 


Glu 




250 255 




Pro 


Leu Glu Ala Thr Lys Met 


Val 


265 


270 




Ser 


Thr Ala Leu Ser Arg Val 


Ser 




285 




Gly 


Thr Leu Met Ala Arg Val- 


Arg 




300 




Asp 


Val Pro Phe Leu Ser Ala 


Ser 




315 


320 


Val 


Thr Gly Leu Leu Gin Leu 


Gly 




330 335 


Gin 


Arg Leu Leu Ala Leu Leu 


Gin 


345 


350 




Ser 


Asn Ser Pro Ser Val Val 


He 




365 




Thr 


Pro Ala His Cys Thr Gin 


Tyr 




380 




Val 


Ala Gly Val Pro Ala Arg 


Thr 




395 


400 


He 


Gin Gin Leu Phe Gly Cys 


He 




410 415 




Gly 


Leu Ala Arg Glu Tyr Gly 


His 


425 


430 




Arg 


Arg He Gin Gly Ala Ser 


Glu 




445 




Ala 


Val Gly Leu Ser Gly Gly 


Val 




460 




Met 


Gly Pro Ala Val Pro Thr 


Glu 
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470 






475 


480 




XJC IX 


Al a Sor 
Axa uci 


T.pn Arrr 

XJC Li nX y 


Arg Thr 


Leu 


Val Glv Glu 

v dJ. oxy ox n. 


Phe Glu Thr Ala 








4R5 






490 


4 95 


ox lx 


Arg 


Arrr Pho 
nX y /TilC 


Oft -v- 7\ T a 
OCX XXX d 


rii v ni n 
vjxy ox i i 


Pro 


Car T.on T.on 

OCl XJCLX llCLX 


Ar*<ir Gl ii Thr* Ala 






son 






505 




510 


xjBU 


lie 


lip XJCU 


7\ C n Val 
nop Vdl 


Tttt* Gl \7 

lyx oxy 


fill n 


X 111 XIXO -LIjV 


7\ qn T.on Thr* Prn 
nojj xjclx xi xx irxLj 






515 




520 






525 


X Hi 


Th r 
1 xix 


Pr r\ Ala 
rxu nXa 


T>Vi r Pv»ft 
1X11 rlU 


T.on Qor 
XJCLX OCX 


Ala 


T.on T.on Prn 
xjclx xjclx itxlj 


Val Glv Prn Pm 
v dx LJxy xrxLx rxu 




5^0 
«J J vx 






5*35 




540 




OtrX 


Hi c 
fllo 


nia rro 


Qor Va 1 
OCX Vdl 


U 4 c T ft 1 1 

nis lieu 


Al a 
nld 


Ala Ala r PV»T* 
nXd Aid 1X11 


T.tro Tlo Byrr Pho 

xjy o xxc nx y irxie 








550 






555 


560 


Pro 


rilci 


lie U. ulU 


rn rift 
oiy lie 


Hi ca Prn 
nis xriLj 


Asn 


Va 1 T.on Al a 
vdl XjCLX nld 


Bqn Pm Gl \r Pho 
no*j rxu ox y irxic 








5G5 






S70 


575 


Va 1 


Pro 


Twr Va 1 

iyi vctx 


T.an 21 1 a 
He IX n.ld 


T.on Va 1 
xjclx vdl 


Val 
v dx 


Civ A QTi Ala 
uxy nay nia 


T.on Arrr Ala Thr 
xjclx nxLj nxd XXll 






5R0 






5R5 




590 

<J ZJ \J 


fi* ".7Q 


noil 


Ala Al a 


Tt/r T.on 
x y x. xjclx 


Prn Arrr 
ci vj nx y 


tl LX 


Tlo fill n Pho 

IXC u± Li IT liC 


Ala Tion Arrr Val 
nx d xjc lx nx y v d x 






595 




600 






605 


T.on 


Al a 
raid 


Trn Ala 
x i \J nxd 


Arrr A*?r** 
fii y iiop 


Phe Gly 


Leu 


(11 \r Tvr Tton 
v?xy iyi XJCLX 


Pro Thr Val Gin 

rxy x iix v dx u± u 




OIU 






615 




620 




Gly 


Hi q 
nxs 


Arg Thr 


xjy o xjclx 


Gl v Ala 

oxy n± ci 


Leu 


Tlo Thr* T.on 


TiOii Gin Pro Ala 


625 






630 






635 


640 


Thr 

X A AX. 


A r IT 

ni y 


Ala Gl v 


Val v 
v dx oxy 


Pro Thr 


Met 


Gin Met Al a 
vuxii net nxd 


Acjri Ran Tlo Gin 
nop noil xxc vjxla 








645 






650 


655 


Gin 


ucu 


T.on Arrr 
xjc ix nx y 


(21 n TiOii 
oxu UCU 


Tyr Val 


He 


Al a Arrr Gl v 
nx d nx y lztx y 


Ala Val Gin Gin 
nxa vdx ox lx oxii 






660 






665 




670 


xjc lx 


ni y 


Pro Al a 
nu nXd 


Val Kin 

V dl L7_L.11 


T.on Prn 

XJCLX tlU 


t 1U 


Prn (Z'\ n Prn 

XTXLJ V7X11 XT1LJ 


Pro Gin Val Glv 
t x u ox lx vdx ox y 






675 




680 






685 




Ocx 


T.on T.on 
lie LI Xjc U. 


T.on T 1 o 
Xjc IX lie 


Ocl IMC L. 


T vr 

iyr 


Al a T.on Al a 
nld Xjclx nld 


Ala Arrr fZl \i Val 
nXd niy OXy Vdl 




Ol7V 










700 
f \J\J 




L6U. 


bin 


Pill T Q1 1 

ol lx IicU 


Ala Gl n 
nld olU 


Arg Ala 


Asp 


Pm T.on Va 1 

xrxLx lit: lx vdx 


nxy bin lieU olU 


705 






710 
/ xu 






71 5 

llj 


790 


Asp 


Hid 


Tie Va 1 
11c Val 


T on T.on 
XjClx Xjclx 


7\ r rr T.on 
m y xjcu 


Hi q 
nib 


f\Aft+- 2\rnr TVit* 
l v Xc L ni y 1 ill 


T.oii A 1 a 11 a PV>o 
IicU nld nld tHc 








7?5 






7^0 

/ JU 


7^R 


jrnc 


LjXU 


Lys xnxy 


pha rzi ii 

XT 1 1C OX LX 


Ocl nbp 


Gl \7 


nis niy xjcu 


rp. 7r 1\1 =a T7=»l X7a 1 

iyx nia vai vai 






740 






745 




750 


fixa 


Asp 


Ala H-l q 
ni ci n 1 S 


r* "J it 7V -r~ rr 
OXLX nx y 


XJCLX Lzrxy 


xrx(j 


•y rr> S r /t Pm 
1 i p niy rxu 


fill n Al a Mot* fill tr 
ulu nld rlcC bxy 






755 




760 






765 


ni3)J 


AT a 
nld 


Val °.P>r 
vctx Ocl 


Gin Tyr 


Cys Gly 


Moh 

L 1C L, 


1 y X rxxo noiy 


Al a T.tre ZirrT A 1 £» 

nxd xy t> ni y nxd 




770 






775 




780 




T.on 


Va 1 
Vdl 


Ala Qor 
nXa ucl 


T.on Ala 
lie Li nxd 


fill \7 T.on 
Lzrxy XJCLX 


- 

Arg 


Car Val Val 
ocx vdx vdx 


Thr fill n TV^r TVtr 

nil uiu inr ini 


785 






790 






795 


ftOO 


Al a 

nXd 


Hi ct 
nia 


T.on fill \7 
xjclx uxy 


Val Cys 


Ban fill n 
noLy lit j. lx 


T.on 

XJCLX 


Ala Ala Gin 
nxd nx a. uxii 


Val Qfty. u-5 o ni„ 
vdl Ocx xllS olU 








80S 






RIO 


pi c 
o lO 


oxy 


Asn 


Val Lon 

VOX XJC IX 


Al a Va 1 
nXd v dx 


Val Am 
v dx nx y 


nx. y 


Gl n Tlo Hi q 
L3XLX xxe nxs 


fZ~\ \r Pho T on JM m 

oiy rne xjcu nia 












O vJ 




R^O 


Tl o 
lie 


Va 1 

vai 


Car* Gl T7 
Ocl oxy 


Tl o Ri q 

XXC ill CJ 


Ala Irn 
nld nl y 


Al a 
nld 


Q ft r T.x/Q T.on 
Ocl ixyo lieu 


Mflf Oft "K» PI if 7\ c?Tft 

•■let oer vjiy Asp 






835 




840 






845 


Gin 


Val 


Pro Gly 


Phe Cys 


Tyr Met 


Ser 


Gin Phe Leu 


Ala Arg Trp Arg 




850 






855 




860 




Arg 


Leu 


Ser Ala 


Gly Tyr 


Gin Ala 


Ala 


Arg Ala Ala 


Thr Gly Pro Glu 


865 






870 






875 


880 


Arg 


Val 


Ala Glu 


Phe Val 


Gin Glu 


Leu 


His Asp Thr 


Trp Lys Gly Leu 








885 






890 


895 


Gin 


Thr 


Glu Arg 


Ala Leu 


Val Val 


Ala 


Arg Phe Ala 


Ser Ser Ala Asp 






900 






905 




910 


Gin 


Arg 


Thr Ala 


Ala He 


Gin Glu 


Val 


Met Ala His 


Ala Thr Glu Asp 






915 




920 






925 


Ala 


Pro 


Pro Ser 


Pro Ala 


Ala Asp 


Leu 


Val Val Leu 


Thr Asn Arg His 
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930 935 940 

Asp Leu Gly Ala Trp Gly Asp Tyr Ser Leu Gly Pro Leu Gly Gin Pro 
945 950 955 960 

Thr Val Val Pro Asp Ser Val Asp Leu Ser Pro Gin Gly Leu Ala Ala 

965 970 975 

Thr Leu Ser Met Asp Trp Leu Leu lie Asn Glu Leu Leu Gin Val Thr 
980 985 990 

Asp 

<210> 51 
<211> 1113 
<212> PRT 
<213> HSV-2 

<400> 51 

Met Ser Asp Ser Ala Leu Gin Val Pro Ala Pro Ala Gly Met Thr Pro 

5 10 15 

Pro Ser Ala Pro Pro Pro Asn Gly Pro Leu Gin Val Leu Leu Gly Ser 

20 25 - 30 

Leu Thr Asn Leu Arg Arg Pro Pro Ser Pro Ser Ser Glu Pro Ala Gly 

35 40 45 

Ser Ala Asp Glu Pro Ala Phe Leu Ser Ala Ala Lys Leu His Ala Ala 

50 55 60 

Thr Ala Ala Phe Leu Leu Ser Gly Ala Ala Val Gly Pro Ala Glu Ala 
65 70 75 " 80 

Arg Ala Cys Trp His Pro Leu Leu Glu Gin Leu Cys Ala Leu His Arg 

85 90 95 

Ala His Gly Leu Pro Glu Thr Ala Leu Leu Ala Glu Asn Leu Pro Gly 

100 105 110 

Leu Leu Val His Arg Met Ala Val Ala Leu Pro Glu Thr Pro Glu Ala 

115 120 125 

Ala Phe Arg Glu Met Asp Val He Lys Asp Thr Val Leu Ala He Thr 

130 135 140 

Gly Ser Asp Thr Thr His Ala Leu Glu Ala Ala Gly Leu Arg Thr Thr 
145 150 155 160 

Ala Ala Leu Gly Pro Val Arg Val Arg Gin Cys Ala Val Glu Trp He 

165 170 175 

Asp Arg Trp Arg Thr Val Thr Gin Ser Cys Leu Ala Met Asn Pro Arg 

180 185 190 

Thr Ser Leu Glu Ala Leu Gly Glu Met Ser Leu Lys Met Ser Pro Val 

195 200 ~* 205 

Pro Leu Gly Gin Pro Gly Ala Asn Leu Thr Thr Pro Ala Tyr Ser Leu 

210 215 220 

Leu Phe Pro Ser Pro He Val Gin Glu Gly Leu Arg Phe Leu Ala Leu 
225 230 235 " 240 

Val Ser Asn Trp Val Thr Leu Phe Ser Ala His Leu Gin Arg He Asp 

245 250 255 

Asp Ala Ala Leu Thr Pro Leu Thr Arg Ala Leu Phe Thr Leu Ala Leu 

260 265 270 

Val Asp Asp Tyr Leu Thr Thr Pro Asp Arg Gly Ala Val Val Pro Pro 

275 280 " 285 

Pro Leu Leu Ala Gin Phe Gin His Thr Val Arg Glu He Asp Pro Ala 

290 295 300 

He Met He Pro Pro Leu Glu Ala Thr Lys Met Val Arg Ser Arg Glu 
305 310 315 ' 320 

Glu Val Arg Val Ser Thr Ala Leu Ser Arg Val Ser Pro Arg Ser Ala 

325 330 335 

Cys Ala Pro Pro Gly Thr Leu Met Ala Arg Val Arg Thr Asp Ala Ala 
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340 


345 


350 






Val 


Phe 


Asp 


Pro 


Asp Val Pro Phe Leu Ser Ala 


Ser Ala Leu 


Ala 


He 






355 




360 


365 






Phe 


Ara 


Pro 


Ala 


Val Thr Gly Leu Leu Gin Leu 


Gly Glu Pro 


Pro 


Ser 




370 






375 


380 






Ala 


Glv 


Ala 


Gin 


Gin Ara Leu Leu AT a Lpu Leu 


Gin Gin Thr 

wJ.il VJJ.il 


Tm 


Ala 


385 








390 395 






400 


Leu 


Val 


Gin 


Asn 


Ser Asn Ser Pro Ser Val Val 


He Asn Thr 


Leu 


Thr 










405 410 




415 




Asp 


Ala 


Glv 


Phe 


Thr Pro Ala His Cvs Thr Gin 


Tvr He Ser 


Ala 


Leu 








420 


425 


430 






Glu 


Glv 


Phe 


Leu 


Val ' Ala Glv Val Pro Ala Ara 


Thr Pro Pro 


Glv 


His 






435 




440 


445 






Gly 


Leu 


Ser 


Glu 


lie Gin Gin Leu Phe Gly Cys 


He Ala Leu 


Ala 


Glv 




450 






455 


460 






Ala 


Asn 


Val 


Phe 


Gly Leu Ala Arg Glu Tyr Gly 


His Tvr Ala 

11 JU fc-J 1» Y JL» *1»1» 


Glv 


Tvr 


465 








470 • 475 






480 


Val 


Lvs 


Thr 


Phe 


Arg Arg He Gin Gly Ala Ser 


Glu His Thr 


His 


Glv 










485 490 




495 




Arq 


Leu 


Cvs 


Glu 


Ala Val Gly Leu Ser Gly Gly 


Val Leu Ser 


Gin 


Thr 








500 


505 


510 






Leu 


Ala 


Arg 


lie 


Met Gly Pro Ala Val Pro Thr 


Glu His Leu 

U»LU XiJ-*«J 


Ala 


Ser 






515 




520 


525 






Leu 


Arg 


Ara 


Thr 


Leu Val Gly Glu Phe Glu Thr 


Ala Glu Arg 


Arg 


Phe 




530 






535 


540 






Ser 


Ala 


Glv 


Gin 


Pro Ser Leu Leu Arg Glu Thr 


Ala Leu He 


Trn 


Leu 


545 








550 555 






560 


Asp 


Val 


Tvr 


Glv 


Gin Thr His Trp Asp Leu Thr 


Pro Thr Thr 


Pro 


Ala 










565 " 570 




575 




Thr 


Pro 


Leu 


Ser 


Ala Leu Leu Pro Val Gly Pro 


Pro Ser His 


Ala 


Pro 








580 


585 


590 






Ser 


Val 


His 


Leu 


Ala Ala Ala Thr Lvs lie Ara 


Phe Pro AT a 


Leu 


Glu 






595 




600 


605 






Glv 


lie 


His 


Pro 


Asn Val Tipu Al a Asn Pro Ol v 


Phe Val Pro 
n lie v ola. nu 




Val 




610 






615 


620 






Leu 


Ala 


Leu 


Val 


Val Glv Asn Ala Leu Ara Ala 


Thr Cvs Asn 

JL 1 1 JL V> Jjf O JCiO 1 1 


Ala 


Ala 


625 








630 635 




1 


640 


Tvr 


Leu 


Pro 


Arg 


Pro He Glu Phe Ala Leu Ara 


Va 1 Tien A*l ^ 




Ala 










645 650 




655 




Ara 


Asp 


Phe 


Glv 


Leu Gly Tyr Leu Pro Thr Val 


Glu Glv His 


Arg 


Thr 








660 


665 


670 






Lys 


Leu 


Glv 


Ala 


Leu He Thr Leu Leu Glu Pro 


Ala Thr Ara 

riAti i iiju «ii u 


Ala 


Glv 






675 




680 


685 






Val 


Glv 


Pro 


Thr 


Met Gin Met Ala Asp Asn He 


Glu Gin Leu 

V^J^JUIA VXJ>1 XJWU 


Leu 


Arg 




690 






695 


700 






Glu 


Leu 


Tvr 


Val 


He Ala Ara Glv Ala Val Glu 


Gin Leu Ara 

vjxii lieu nj.u 


Pro 


Ala 


705 








710 715 




720 


Val 


Gin 


Leu 


Pro 


Pro Pro Gin Pro Pro Glu Val 


Gly Ser Ser 


Leu 


Leu 










725 730 




735 




Leu 


lie 


Ser 


Met 


Tyr Ala Leu Ala Ala Arg Gly 


Val Leu Gin 


Glu 


Leu 








740 


745 


750 






Ala 


Glu 


Arg 


Ala 


Asp Pro Leu Val Arg Gin Leu 


Glu Asp Ala 


He 


Val 






755 




760 


765 






Leu 


Leu 


Arg 


Leu 


His Met Arg Thr Leu Ala Ala 


Phe Phe Glu 


Cys 


Arg 




770 






775 


780 






Phe 


Glu 


Ser 


Asp 


Gly His Arg Leu Tyr Ala Val 


Val Ala Asp 


Ala 


His 


785 








790 795 




800 


Glu 


Arg 


Leu 


Gly 


Pro Trp Arg Pro Glu Ala Met 


Gly Asp Ala 


Val 


Ser 
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805 










810 


815 


Gin 


Tvr 


Cys 


Glv 


Met 


Tvr 


His 


Asp 


Ala 


Lvs Am 

Jul V uy JTi 1 \J 


Ala T.pii Val Ala 9e»r 

nxa lieu v al nX ci JCi 








820 










825 






Leu 


Ala 


Glv 


Leu 


Arg 


Ser 


Val 


Val 


Thr 


Glu Thr 


Thr Ala HH«5 T.<»n Glv 
j-iiJL r\jL a riio lieu OAjf 






835 










840 






845 


Val 


Cys 


Asp 


Glu 


Leu 


Ala 


Ala 


Gin 


Val 


Ser His 

OCX il_L O 


Gin Gl \r Hen Va 1 T.on 
oi n ox _y nsiii vox lieu 




850 










855 








860 


Ala 


Val 


Val 


Arg 


Arg 


Glu 


He 


His 


Gly 


Php Ti^n 


Ala T1p Val <?pt- Glv 

ftlo JLXc val OCX \JJ.y 


865 










870 








875 




lie 


His 


Ala 


Arg 


Ala 


Ser 


Lys 


Leu 


Met 


Ser Gly 


A an Gin Val Prn Gl \r 










885 










890 


895 


Phe 


Cys 


Tvr 


Met 


Ser 


Gin 


Phe 


Leu 


Ala 


Arg Trp 


Ay*<t A T~rr Ti<an Qor Al a 
nx^j niy UCLL OCX Ala 








900 










905 




910 


Glv 


Tvr 

JT 


Gin 


Ala 


Ala 


Arg 


Ala 


Ala 


Thr 


Gly Pro 


Glu Arrr Val Ala Gin 






915 










920 






925 


Phe 


Val 


Gin 


Glu 


Leu 


His 


Asp 


Thr 


Trp 


Lys Gly 


Levi Gin Thr Gl n Arrr 

ilCU VjU.11 J. 11JL VJJ JL u. rlL y 




930 










935 








940 


Ala 


Leu 


Val 


Val 


Ala 


Arg 


Phe 


Ala 


Ser 


Ser Ala 


Asp Gin Arg Thr Ala 


945 










950 








955 


960 


Ala 


He 


Gin 


Glu 


Val 


Met 


Ala 


His 


Ala 


Thr Glu 


Asp Ala Pro Pro Ser 










965 










970 


975 


Pro 


Ala 


Ala 


Asp 


Leu 


Val 


Val 


Leu 


Thr 


Asn Arg 


His Asp Leu Gly Ala 








980 










985 




990 


Trp 


Gly 


Asp 


Tyr 


Ser 


Leu 


Gly 


Pro 


Leu 


Gly Gin 


Pro Thr Val Val Pro 






995 










1000 




1005 


Asp 


Ser 


Val 


Asp 


Leu 


Ser 


Pro 


Gin Gly 


Leu Ala 


Ala Thr Leu Ser Met 




1010 








1015 






1020 


Asp 


Trp Leu 


Leu 


He 


Asn 


Glu 


Leu 


Leu 


Gin Val 


Thr Asp Gly Val Phe 


1025 








1030 






1035 1040 


Arg Ala 


Ser 


Ala 


Phe 


Arg 


Pro 


Ser Ala 


Gly Pro 


Gly Ala Pro Gly Asp 










1045 








1050 


1055 


Leu 


Glu 


Ala 


Gin 


Asp Ala Gly Gly Ser 


Thr Pro 


Glu Pro Thr iThr Pro 








1060 








1065 


1070 



Gly Pro Gin Asp Thr Gin Ala Arg Ala Pro Ser Thr Pro Ala Gly Arg 
1075 1080 1085 



Glu Thr Val Pro Trp Pro Asn Thr Pro Val Glu Asp Asp Glu Met Thr 

1090 1095 1100 

Pro Gin Glu Thr Pro Pro Val His Pro 
1105 1110 



<210> 52 
<211> 3113 
<212> DNA 
<213> HSV-2 

<400> 52 

atgtcggaca gcgcgctcca ggtcccggcg cccgccggca tgaccccccc gtccgctcca 60 
cccccaaacg gaeegttgea ggtcctgttg gggagcctga ccaacctccg ccggccgccc 120 
tccccctcga gcgagccggc gggcagcgcg gaegagcegg cgtttctgtc cgcggccaaa 180 
ctgcgcgccg ccacggcggc gtttctgttg tegggegegg cggtcggccc ggcggaggcg 240 
eggqegtget ggcacccgct cctggagcaa ctgtgcgcct tgcaccgggc ccacgggctt 300 
ceggagaegg cgctcttggc cgagaacctc cccgggctgc tggtgcaccg catggcggtg 360 
gcgctccccg agacccccga ggcggccttc egggagatgg aegtcatcaa ggacacggtc 420 
ctggcgatca ccggctccga cacgacccac gcgctggagg ccgccgggct gcgcaccacg 480 
gcggccctgg ggccggtgcg cgtccgccag tgtgccgtgg agtggatcga ccgctggcgg 540 
accgtcaccc agagctgect ggecatgaac ccgcggacct ccctggaggc ecteggggag 600 
atgtcgctta agatgtcccc ggttcccctg gggcaacccg gcgccaacct gaccaccccg 660 
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gcgtacagcc tgctcttccc gtccccgatc 
gtcagcaatt gggtgacgct gttctcggcg 
acccccctca cgcgggccct gtttacgctg 
gatcggggcg cggtggtccc gccgccgctg 
atcgacccgg ctatcatgat cccgcccctc 
gaggtgcgcg tgtcgaccgc cctaagccgc 
gggacgctga tggcgcgcgt gcggacggac 
ctgagtgcgt cggcgctggc catcttccgt 
gagccgccct ccgccggggc ccagcaacgg 
ttggtccaga acagcaactc gccctccgtg 
acgcccgcgc actgcaccca gtacatatcg 
cccgcgcgga cgccccccgg ccacgggctc 
gcccttgcgg gggcgaacgt gttcgggctc 
gtgaaaacgt tccggcggat ccagggcgcc 
gcggtcggcc tgtcgggcgg cgtcctcagc 
gtgccgacgg aacacctggc gagcctgcgg 
gagcgccgct tcagcgccgg ccagcccagc 
gacgtgtacg gccagaccca ctgggacctg 
gcgctgctcc ccgtcgggcc gcctagccac 
aagatccgct tcccggccct ggagggcatc 
gtcccttacg tgctggccct ggtggtcggg 
tacctgcccc gccccatcga gtttgccctg 
ctgggctatc tccccaccgt cgaggggcac 
ctcgaaccgg ccacccgggc cggcgtcggg 
cagctgctgc gcgagctgta cgtgatcgcc 
gtccagctgc ctccccccca gcctcccgag 
tacgccctgg cggcccgggg ggtgctgcag 
cgccagctgg aggacgcgat cgtgctgctg 
tttgagtgcc ggttcgagag cgacgggcac 
gagcgcctgg ggccctggcg ccccgaggcc 
atgtatcacg acgccaagcg cgcgctggtc 
accgagacga cggcgcacct cggggtatgc 
ggtaacgtgc tggccgtcgt gcggcgggaa 
atccacgcca gggcgtcaaa gttgatgtcg 
agtcagttcc tggcgcgctg gcggcgcctc 
acgggccccg agcgggtggc cgagttcgtt 
cagacggagc gcgcgctggt cgtggcgccc 
gccattcagg aggtgatggc ccacgcgacc 
ctcgtcgtgc tcacgaaccg tcatgacctg 
ctgggtcagc cgacggtggt tccggactcc 
acgctgagca tggactggct attaataaac 

<210> 53 
<211> 761 
<212> DNA 
<213> HSV-2 



gtgcaagagg gtctccggtt cttggccctg 720 
cacctacagc gcatcgacga cgccgccctg 780 
gcgctggtgg acgagtacct gacgaccccc 840 
ctggcccagt ttcagcacac ggtgcgggag 900 
gaggccacca aaatggttcg cagtcgcgag 960 
gtcagcccgc ggtcggcgtg cgcgcccccg 1020 
gcggccgtct tcgaccccga cgtgccgttt 1080 
ccggccgtca ccggcctgct gcagctaggg 1140 
ctcctcgcgc tgctgcagca gacgtgggct 1200 
gtgatcaaca cgcttaccga cgcggggttc 1260 
gccctggagg ggtttttggt ggcgggggtg 1320 
agcgagatcc agcagctctt tgggtgcatt 1380 
gcgcgggagt acgggcacta cgccggctac 1440 
agcgagcaca cgcacgggcg gctctgtgag 1500 
cagacgctgg cgcgcatcat gggcccggcc 1560 
cgcacgctcg tgggggagtt tgagacggcc 1620 
ctcctgcgcg agacggcgct catctggctc 1680 
acccccacca ccccggccac gccgctgtcc 1740 
gccccctcgg ttcacctggc cgcggccacc 1800 
caccccaacg tcctcgccga cccggggttc 1860 
gacgcgctga gggccacgtg caacgctgcc 1920 
cgggtgctgg cctgggcgcg cgacttcggc 1980 
cgcacaaaat tgggcgcgct gatcaccctc 2040 
cccacgatgc agatggccga caacatcgag 2100 
aggggggccg tcgagcagct gcggcccgcg 2160 
gtgggcagca gcctgctgct gatcagcatg 2220 
gagctcgccg agcgcgccga ccccctggtc ' 2280 
cggcttcaca tgcgcacgct cgccgccttc 2340 
cgcctgtacg ccgtggtcgc agacgcccac 2400 
atgggcgacg cggtgagcca gtactgcggc 2460 
gcgtccctcg cggggctgcg gtccgtcgtc 2520 
gacgagctgg cggcccaggt ctcgcacgag 2580 
atccacgggt tcctggccat cgtgtcgggc 2640 
ggggaccagg tccccgggtt ctgttacatg 2700 
tcggccggct atcaggccgc gcgcgcggcc 2760 
caggaactcc acgacacgtg gaagggcctg 2820 
ttcgccagct cggccgatca gcgcaccgca 2880 
gaggacgcgc cccccagccc ggccgcagac 2940 
ggggcctggg gggactacag cctggggccc 3000 
gtggacctgt ctccccaggg gctggccgct 3060 
gagctcctgc aggtcaccga egg 3113 



<400> 53 

gcgcccgctc gcggctcagc gcgaggccgc 
gccagacgcg gaggecgaac eggaccagat 
cgccgccatg cccgcgggcg tegggcttgg 
cgcctggccg gccgaaagcc acgccccccg 
egagteggtg agegaggatg gggggcgcgt 
cgaaaacatc tgccttcgcc ggcaagaege 
ggactccccg tacatcgagg eggaaaatec 
ctcccctccg ggggccacac gcgccccgga 
ccgcccccgg accaacgcgc tggecaaega 



eggggtttae gacgcggtgc ggacctgggg 60 

ggaaaacacg tatctgetge ccgacgatga 120 

cgccaccccc gccgccgaca ccaccgccgc 180 

cgccccctcc gaggaegcag attccattta 240 

ctacgaggag atcccytggg ttegggtata 300 

eggeggggeg gccccgccgg gagacgcccc 360 

cctgtacgac tggggcgggt ctgccctctt 420 

cccgggacta agcctgtcgc ccatgcccgc 480 

cggcccgaca aacgtcgccg ccctcagcgc 540 
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cctgttgacg aagctcaaac gcggccgaca ccagagccat taaaaaaatg cgaccgccgg 600 
ccccaccgtc tcggtttccg gcccctttcc ccgtatgtct gttttcaata aaaagtaaca 660 
aacagagaaa aaaaaacagc gagttccgca tggtttgtcg tacgcaatta gctgtttatt 720 
gttttttttt tggggggggg aagagaaaaa gaaaaaagga g 761 

<210> 54 
<211> 1037 
<212> PRT 
<213> HSV-2 

<400> 54 

Met Ser Asp Ser Ala Leu Gin Val Pro Ala Pro Ala Gly Met Thr Pro 
5 10 15 

Pro Ser Ala Pro Pro Pro Asn Gly Pro Leu Gin Val Leu Leu Gly Ser 
20 25 30 

Leu Thr Asn Leu Arg Arg Pro Pro Ser Pro Ser Ser Glu Pro Ala Gly 
35 40 45 

Ser Ala Asp Glu Pro Ala Phe Leu Ser Ala Ala Lys Leu Arg Ala Ala 
50 " 55 60 

Thr Ala Ala Phe Leu Leu Ser Gly Ala Ala Val Gly Pro Ala Glu Ala 
65 70 75 80 

Arg Ala Cys Trp His Pro Leu Leu Glu Gin Leu Cys Ala Leu His Arg 
85 - 90 95 

Ala His Gly Leu Pro Glu Thr Ala Leu Leu Ala Glu Asn Leu Pro Gly 
100 105 110 

Leu Leu Val His Arg Met Ala Val Ala Leu Pro Glu Thr Pro Glu Ala 
115 120 125 

Ala Phe Arg Glu Met Asp Val lie Lys Asp Thr Val Leu Ala lie Thr 
130 135 140 

Gly Ser Asp Thr Thr His Ala Leu Glu Ala Ala Gly Leu Arg Thr Thr 
145 150 155 160 

Ala Ala Leu Gly Pro Val Arg Val Arg Gin Cys Ala Val Glu Trp lie 
165 170 175 

Asp Arg Trp Arg Thr Val Thr Gin Ser Cys Leu Ala Met Asn Pro Arg 
180 185 190 

Thr Ser Leu Glu Ala Leu Gly Glu Met Ser Leu Lys Met Ser Pro Val 
195 200 205 

Pro Leu Gly Gin Pro Gly Ala Asn Leu Thr Thr Pro Ala Tyr Ser Leu 
210 215 220 

Leu Phe Pro Ser Pro lie Val Gin Glu Gly Leu Arg Phe Leu Ala Leu 
225 230 235 240 

Val Ser Asn Trp Val Thr Leu Phe Ser Ala His Leu Gin Arg He Asp 
245 250 255 
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Asp Ala Ala Leu Thr Pro Leu Thr Arg Ala Leu Phe Thr Leu Ala Leu 
260 265 270 

Val Asp Glu Tyr Leu Thr Thr Pro Asp Arg Gly Ala Val Val Pro Pro 
275 280 285 

Pro Leu Leu Ala Gin Phe Gin His Thr Val Arg Glu lie Asp Pro Ala 
290 295 300 

lie Met lie Pro Pro Leu Glu Ala Thr Lys Met Val Arg Ser Arg Glu 
305 310 315 320 

Glu Val Arg Val Ser Thr Ala Leu Ser Arg Val Ser Pro Arg Ser Ala 
325 330 335 

Cys Ala Pro Pro Gly Thr Leu Met Ala Arg Val Arg Thr Asp Ala Ala 
340 345 350 

Val Phe Asp Pro Asp Val Pro Phe Leu Ser Ala Ser Ala Leu Ala lie 
355 360 365 

Phe Arg Pro Ala Val Thr Gly Leu Leu Gin Leu Gly Glu Pro Pro Ser 
370 375 380 

Ala Gly Ala Gin Gin Arg Leu Leu Ala Leu Leu Gin Gin Thr Trp Ala 
385 390 395 " 400 

Leu Val Gin Asn Ser Asn Ser Pro Ser Val Val lie Asn Thr Leu Thr 
405 410 415 

Asp Ala Gly Phe Thr Pro Ala His Cys Thr Gin Tyr He Ser Ala Leu 
420 425 430 

Glu Gly Phe Leu Val Ala Gly Val Pro Ala Arg Thr Pro Pro Gly His 
435 440 445 

Gly Leu Ser Glu He Gin Gin Leu Phe Gly Cys He Ala Leu Ala Gly 
450 455 460 

Ala Asn Val Phe Gly Leu Ala Arg Glu Tyr Gly His Tyr Ala Gly Tyr 
465 470 475 480 

Val Lys Thr Phe Arg Arg He Gin Gly Ala Ser Glu His Thr His Gly 
485 490 495 

Arg Leu Cys Glu Ala Val Gly Leu Ser Gly Gly Val Leu Ser Gin Thr 
500 505 510 

Leu Ala Arg He Met Gly Pro Ala Val Pro Thr Glu His Leu Ala Ser 
515 520 525 

Leu Arg Arg Thr Leu Val Gly Glu Phe Glu Thr Ala Glu Arg Arg Phe 
530 . 535 540 



Ser Ala Gly Gin Pro Ser Leu Leu Arg Glu Thr Ala Leu He Trp Leu 
545 550 555 560 
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Asp Val Tyr Gly Gin Thr His Trp Asp Leu Thr Pro Thr Thr Pro Ala 
565 570 575 

Thr Pro Leu Ser Ala Leu Leu Pro Val Gly Pro Pro Ser His Ala Pro 
580 585 590 

Ser Val His Leu Ala Ala Ala Thr Lys lie Arg Phe Pro Ala Leu Glu 
595 600 605 

Gly lie His Pro Asn Val Leu Ala Asp Pro Gly Phe Val Pro Tyr Val 
610 615 620 

Leu Ala Leu Val Val Gly Asp Ala Leu Arg Ala Thr Cys Asn Ala Ala 
625 630 635 ~ 640 

Tyr Leu Pro Arg Pro lie Glu Phe Ala Leu Arg Val Leu Ala Trp Ala 
645 650 655 

Arg Asp Phe Gly Leu Gly Tyr Leu Pro Thr Val Glu Gly His Arg Thr 
660 665 670 

Lys Leu Gly Ala Leu lie Thr Leu Leu Glu Pro Ala Thr Arg Ala Gly 
675 680 685 

Val Gly Pro Thr Met Gin Met Ala Asp Asn lie Glu Gin Leu Leu Arg 
690 695 .700 

Glu Leu Tyr Val lie Ala Arg Gly Ala Val Glu Gin Leu Arg Pro Ala 
705 710 715 720 

Val Gin Leu Pro Pro Pro Gin Pro Pro Glu Val Gly Ser Ser Leu Leu 
725 730 * 735 

Leu lie Ser Met Tyr Ala Leu Ala Ala Arg Gly Val Leu Gin Glu Leu 
740 745 750 

Ala Glu Arg Ala Asp Pro Leu Val Arg Gin Leu Glu Asp Ala He Val 
755 760 765 

Leu Leu Arg Leu His Met Arg Thr Leu Ala Ala Phe Phe Glu Cys Arg 
770 775 780 

Phe Glu Ser Asp Gly His Arg Leu Tyr Ala Val Val Ala Asp Ala His 
785 790 795 800 

Glu Arg Leu Gly Pro Trp Arg Pro Glu Ala Met Gly Asp Ala Val Ser 
805 810 815 

Gin Tyr Cys Gly Met Tyr His Asp Ala Lys Arg Ala Leu Val Ala Ser 
820 825 830 

Leu Ala Gly Leu Arg Ser Val Val Thr Glu Thr Thr Ala His Leu Gly 
835 840 845 

Val Cys Asp Glu Leu Ala Ala Gin Val Ser His Glu Gly Asn Val Leu 
850 855 860 



Ala Val Val Arg Arg Glu He His Gly Phe Leu Ala He Val Ser Gly 
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865 870 875 880 

lie His Ala Arg Ala Ser Lys Leu Met Ser Gly Asp Gin Val Pro Gly 

885 890 * 895 

Phe Cys Tyr Met Ser Gin Phe Leu Ala Arg Trp Arg Arg Leu Ser Ala 

900 905 " 910 

Gly Tyr Gin Ala Ala Arg Ala Ala Thr Gly Pro Glu Arg Val Ala Glu 

915 920 925 

Phe Val Gin Glu Leu His Asp Thr Trp Lys Gly Leu Gin Thr Glu Arg 

930 935 " 940 

Ala Leu Val Val Ala Pro Phe Ala Ser Ser Ala Asp Gin Arg Thr Ala 

945 950 955 960 

Ala He Gin Glu Val Met Ala His Ala Thr Glu Asp Ala Pro Pro Ser 

965 970 * 975 

Pro Ala Ala Asp Leu Val Val Leu Thr Asn Arg His Asp Leu Gly Ala 

980 985 ~ " 990 

Trp Gly Asp Tyr Ser Leu Gly Pro Leu Gly Gin Pro Thr Val Val Pro 

995 1000 1005 

Asp Ser Val Asp Leu Ser Pro Gin Gly Leu Ala Ala Thr Leu Ser Met 

1010 1015 1020 

Asp Trp Leu Leu He Asn Glu Leu Leu Gin Val Thr Asp 

,1025 1030 1035 



<210> 55 
<211> 193 
<212> PRT 
<213> HSV-2 

<400> 55 

Arg Pro Leu Ala Ala Gin Arg Glu Ala Ala Gly Val Tyr Asp Ala Val 
5 10 " 15 

Arg Thr. Trp Gly Pro Asp Ala Glu Ala Glu Pro Asp Gin Met Glu Asn 
20 25 30 

Thr Tyr Leu Leu Pro Asp Asp Asp Ala Ala Met Pro Ala Gly Val Gly 
35 40 45 

Leu Gly Ala Thr Pro Ala Ala Asp Thr Thr Ala Ala Ala Trp Pro Ala 
50 55 60 

Glu Ser His Ala Pro Arg Ala Pro Ser Glu Asp Ala Asp Ser He Tyr 
65 70 75 80 

Glu Ser Val Ser Glu Asp Gly Gly Arg Val Tyr Glu Glu He Pro Trp 
85 - ~ 90 95 

Val Arg Val Tyr Glu Asn He Cys Leu Arg Arg Gin Asp Ala Gly Gly 
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100 105 110 

Ala Ala Pro Pro Gly Asp Ala Pro Asp Ser Pro Tyr lie Glu Ala Glu 
115 120 125 

Asn Pro Leu Tyr Asp Trp Gly Gly Ser Ala Leu Phe Ser Pro Pro Gly 
130 135 140 

Ala Thr Arg Ala Pro Asp Pro Gly Leu Ser Leu Ser Pro Met Pro Ala 
145 150 155 160 

Arg Pro Arg Thr Asn Ala Leu Ala Asn Asp Gly Pro Thr Asn Val Ala 
165 170 175 

Ala Leu Ser Ala Leu Leu Thr Lys Leu Lys Arg Gly Arg His Gin Ser 
180 185 190 

His 



<210> 56 
<211> 15 
<212> PRT 
<213> HSV-2 

<400> 56 

Ser Pro Asn Thr Asp Val Arg Met Tyr Ser Gly Lys Arg Asn Gly 
5 " 10 ^ 15 



<210> 57 
<211> 15 
<212> PRT 
<213> HSV-2 

<4O0> 57 

Tyr Leu Ala Ala Pro Thr Gly lie Pro Pro Ala Phe Phe Pro lie 
5 10 15 



<210> 58 
<211> 15 
<212> PRT 
<213> HSV-2 

<400> 58 

Gly Val Ala Ala Ala Thr Pro Arg Pro Asp Pro Glu Asp Gly Ala 
5 10 15 



<210> 59 
<211> 15 
<212> PRT 
<213> HSV-2 



<400> 59 

Glu Glu lie Pro Trp Val Arg Val Tyr Glu Asn lie Cys Pro Arg 
5 10 15 
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<210> 60 
<211> 15 
<212> PRT 
<213> HSV-2 

<400> 60 

Pro Gly Asp Ala Pro Asp Ser Pro Tyr He Glu Ala Glu Asn Pro 
5 10 15 



<210> 61 
<211> 15 
<212> PRT 
<213> HSV-2 

<400> 61 

Pro Asp Ser Pro Tyr He Glu Ala Glu Asn Pro Leu Tyr Asp Trp 
5 10 15 



<210> 62 
<211> 15 
<212> PRT 
<213> HSV-2 

<400> 62 

Glu Asn Pro Leu Tyr Asp Trp Gly Gly Ser Ala Leu Phe Ser Pro 
5 10 15 



<210> 63 
<211> 15 
<212> PRT 
<213> HSV-2 

<400> 63 

Ala He Asp Tyr Val His Cys Glu Gly He He His Arg Asp He 
5 10 15 



<210> 64 
<211> 15 
<212> PRT 
<213> HSV-2 



<400> 64 

Ala Phe Pro Val Ala Leu His Ala Val Asp Ala Pro Ser Gin Phe 
5 10 15 



